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CHAPTER 1 



Introduction 




Constraint satisfaction problems (CSPs) appear in almost every area of theo- 
retical computer science, for instance in artificial intelligence, scheduling, compu- 
tational linguistics, computational biology, verification, and algebraic computation. 
Many computational problems studied in those areas can be modeled by appropriately 
choosing a set of constraint types, the constraint language, that are allowed in the 
input instance of a CSP. In the last decade, huge progress was made to find general 
criteria for constraint languages that imply that the corresponding CSP can be solved 



efficiently 12 56 58 59 61 89 117 



Lately, the complexity of the CSP became a topic that vitalizes the field of uni- 
versal algebra, since it turned out that questions about the computational complexity 
of CSPs translate to important universal-algebraic questions about algebras that can 
be associated to CSPs. This approach is now known as the algebraic approach to 
constraint satisfaction complexity. The algebraic approach has raised questions that 
are of central importance in universal algebra. 

Another reason why the complexity of CSPs attracts attention is an exciting con- 
jecture due to Feder and Vardi [89 , which is still unresolved, and which is known 
as the dichotomy conjecture. This conjecture says that every CSP with a finite do- 
main is either polynomial-time tractable (i.e., in P) or NP-complete. According to 
a well-known result by Ladner, it is known that there are NP- intermediate compu- 
tational problems, i.e., problems in NP that are neither tractable nor NP-complcte 
(unless P=NP). But the known NP-intermediate problems are extremely artificial. It 
would be interesting from a complexity theoretic perspective to discover more natural 
candidates for NP-intermediate problems. Unlike many questions in computational 
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complexity that are wide open, the dichotomy conjecture allows many promising par- 
tial results and different approaches (see the collection of survey articles in |75| ), and 
therefore is an attractive research topic. 

Any outcome of the dichotomy conjecture is significant: a negative answer might 
provide relatively natural NP-intermediate problems, which would be interesting for 
complexity theorists. A positive answer probably comes with a criterion which de- 
scribes the NP-hard CSPs (and it would probably even provide algorithms for the 
polynomial-time tractable CSPs). But then we would have a fascinatingly rich cata- 
logue of computational problems where the computational complexity is known. Such 
a catalogue would be a valuable tool for deciding the complexity of computational 
problems: since CSPs are abundant, one might derive algorithmic results by reducing 
the problem of interest to a known tractable CSP, and one might derive hardness 
results by reducing a known NP-hard CSP to the problem of interest. 

Even though very powerful partial results on the dichotomy conjecture have been 
obtained in recent years, the impact of constraint satisfaction complexity theory on 
other fields in theoretical computer science has so far been modest. A reason might 
be that the range of problems in the literature that can be described by specifying a 
constraint language over a finite domain, and that have been studied independently 
from the CSP framework, is quite limited, and mostly focussed on specialized graph 
theoretic problems or Boolean satisfiability problems. 

If we consider the class of all problems that can be formulated by specifying a 
constraint language over an infinite domain, the situation changes drastically. Many 
problems that have been studied independently in temporal reasoning, spatial reason- 
ing, phylogenetic reconstruction, and computational linguistics can be directly formu- 
lated as CSPs. Also feasibility problems in linear (and also non-linear) programming 
(over the rationals, the integers, or other domains) can be cast as CSPs. 

The goal of this thesis is to generalize the universal-algebraic approach to infinite 
domains. It turns out that this is possible when the constraint language, viewed as a 
relational structure 03 with an infinite domain, is lu- categorical. Many of the CSPs in 
the mentioned application areas can be formulated with w-categorical constraint lan- 
guages — in particular, problems coming from so-called qualitative calculi in artificial 
intelligence tend to have formulations with w-categorical constraint languages. While 
w-categoricity is a quite strong assumption from a model-theoretic point of view (and, 
for example, constraint languages for linear programming cannot be w-categorical) , 
the class of computational problems that can be formulated with w-categorical con- 
straint languages is still a very large generalization of the class of CSPs that can 
be formulated with a constraint language over a finite domain. This will be amply 
demonstrated by examples of w-categorical constraint languages from many different 
areas in computer science in Chapter [3j 

There are several general results for w-categorical structures that are relevant 
when studying the computational complexity of the respective CSPs. Every oj- 
categorical structure is homomorphically equivalent to an w-categorical structure 
which is model- complete and a core. Model-complete cores have many good prop- 
erties: for example, those structures have quantifier elimination once expanded by all 
primitive positive definable relations; this is treated in Chapter[3j Since homomorphi- 
cally equivalent structures have the same CSP, we can therefore focus on constraint 
languages that have those properties. 

Moreover, it can be shown that the so-called polymorphism clone of an cj-categorical 
structure 03 fully captures the computational complexity of the corresponding CSP 
(Chapter [5]). By this observation, universal-algebraic techniques can be used to ana- 
lyze the computational complexity of the CSP for 03. Indeed, the study of CSPs has 
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triggered questions that are of central interest in universal algebra, and that have led 
to considerable new activity (see e.g. 13 20 152 180] ). 

Another tool that becomes useful specifically for polymorphisms over infinite 
domains is Ramsey theory (Chapter[8|. The basic idea here is to apply Ramsey theory 
to show that polymorphisms must act canonically on large parts of their domain. 
Typically there only finitely many possibilities for canonical behavior, and so this 
technique allows to perform combinatorial analysis when proving classification results. 
With this approach we can also show that, under further assumptions on 03, many 
questions about the expressive power of 25 become decidable, such as the question 
whether a given quantifier-free first-order formula is in 03 equivalent to a primitive 
positive formula. 

An important feature of the universal-algebraic approach is that tractability of 
a CSP can be linked to the existence of polymorphisms of the constraint language. 
This link can be exploited in several directions: first, when we already know that a 
constraint language of interest has a polymorphism satisfying good properties, then 
this polymorphism can guide the search for an efficient algorithm for the correspond- 
ing CSP. Another direction is that we already have an algorithm (or an algorithmic 
technique), and that we want to know for which CSPs the algorithm is a correct 
decision procedure: again, polymorphisms are the key tool for this task. Finally, we 
might use the absence of polymorphisms with good properties to prove that a CSP 
is NP-hard. There are several instances where these three directions of the algebraic 
approach have been used very successfully for CSPs with finite domain constraint 
languages |12[|59|[68||117] or w-categorical constraint languages [38[|48| . 

In Chapter]9jand Chapter [iO] we use polymorphisms to classify the computational 
complexity in some large families of constraint satisfaction problems. In Chapter[9j wc 
study constraint languages definable over the random graph, and in Chapter |10| con- 
straint languages definable over (Q;<). Even though the two underlying structures 
are very different from a model-theoretic point of view, and even though the classifi- 
cation proofs are very different in both cases, we can give a common formulation of 
the two classification results that delineates also the border between polynomial-time 
solvable and NP-complete CSPs. 



Chapter outline. Constraint satisfaction problems can appear in several differ- 
ent forms, because there are several ways how CSPs can be formalized. The differences 
in formalizing constraint satisfaction problems are related to the way how instances 
are coded and to how the problem itself is described. In the next sections we present 
four formalisms; each of those formalisms is attached to a different line of research. 
In later sections some arguments are more natural from one perspective than from 
the other, so it will be convenient to have them all discussed here. See Figure [O] for 
an illustration how the four perspectives we discuss can be put into relationship to 
each other. 



Perspective 


Instance 


Problem Description 


Homomorphism 
Sentence Evaluation 
Satisfiability 

Existential Second-Order 


Structure 
Sentence 
Sentence 
Structure 


Structure 
Structure 
Sentences 
Sentence 



Figure 1.1. The four perspectives on the definition of CSPs. 
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1.1. The Homomorphism Perspective 

A relational signature r is a set of relation symbols Ri, each of which has an 
associated finite arity ki. A relational structure 21 over the signature r (also called 
t- structure) consists of a set A (the domain or base set) together with a relation 
i? a C A k for each relation symbol R of arity k from r. It causes no harm to allow 
structures whose domain is empty. 

A homomorphism h from a structure 21 with domain A to a structure 05 with 
domain B and the same signature r is a mapping from A to B that preserves each 
relation for the symbols in t; that is, if (oi, . . . , a^) is in then (/i(ai), . . . , h(ak)) 
must be in i? 93 . An isomorphism is a bijective homomorphism h such that the inverse 
mapping h^ 1 : B — > A that sends /i(a;) to x is a homomorphism, too. 

In this thesis, a (non-uniform) constraint satisfaction problem (CSP) is a compu- 
tational problem that is specified by a single structure with a finite relational signa- 
ture, called the template (or the constraint language; the name 'constraint language' 
is typically used in the context of the second perspective on CSPs that we present in 



Section 1.2). 



Definition 1.1.1 (CSP(05)). Let 05 be a (possible infinite) structure with a finite 
relational signature r. Then CSP(05) is the computational problem to decide whether 
a given finite t -structure 21 homomorphically maps to 05. 

CSP(05) can be considered to be a class - the class of all finite r-structures that 
homomorphically map to 05. 

A homomorphism from a given r-structure 21 to 05 is called a solution of 21 
for CSP(05). It is in general not clear how to represent solutions for CSP(05) on 
a computer; however, for the definition of the problem CSP(05) we do not need 
to represent solutions, since we only have to decide the existence of solutions. To 
represent an input structure 21 of CSP(05) we can fix any representation of the relation 
symbols in the signature r, due to the assumption that r is finite. Thus, CSP(05) is a 
well-defined computational problem for any infinite structure 05 with finite relational 
signature. 

Example 1.1.2 (Digraph acyclicity). Next, consider the problem CSP((Z;<)). 
Here, the relation < denotes the strict linear order of the rational numbers Q. An 
instance 21 of this problem can be viewed as a directed graph (also called digraph) , 
potentially with loops. It is easy to see that 21 homomorphically maps to (Z; <) if 
and only if there is no directed cycle in 21 (loops are considered to be directed cycles, 
too). Again, it is easy to see and well-known that this can be tested in linear time, 
for example by performing a depth-first search on the digraph 21. □ 



Example 1.1.3 (Betweenness) . The so-called betweenness problem 163 can be 
modeled as CSP((Z; Betw)) where Betw is the ternary relation 

{(x, y,z)eZ 3 \(x<y<z)\/(z<y<x)}. 

This problem is one of the NP-complete problems listed in the book of Garey and 



Johnson 95 . □ 



Example 1.1.4 (Cyclic-Ordering). The Cyclic- order problem 93 can be modeled 
as CSP((Z; Cycl)) where Cycl is the ternary relation 

{(x, y,z)&Z 3 \(x<y<z)V(y<z<x)V(z<x<y)}. 

This problem is again NP-complete and can be found in [95] . □ 

Example 1.1.5 (fj-coloring problems). Let be an (undirected) graph. We view 
undirected graphs as r-structures where r contains a single binary relation symbol E, 
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which denotes a symmetric and anti- reflexive relation. Then the ^-coloring problem 
is the computational problem to decide for a given finite graph © whether there exists 
a homomorphism from © to f). For instance, if fj is the graph K$ (the complete graph 
on three vertices), then the ij-coloring problem is the famous 3-colorability problem 
(see e.g. |95| ). Similarly, for every fixed k, the fc-colorability problem can be modeled 
as CSP(-f)), for an appropriate graph f). □ 



The next lemma (Lemma 1.1.71 is a useful test to determine whether a compu- 
tational problem can be formulated as CSP(03) for an infinite relational structure 03. 
An (induced) substructure of a r-structure 21 is a r-structure 03 with B C A and 
i? 33 = i? 21 n B n for each n-ary R G r; we also say that 03 is induced by B in 21, and 
write 2t[£?] for 03. The union of two r-structures 21, 03 is the r-structure 21 U 03 with 
domain A U B and relations R* u ® = R % U i? s for all R G r. A disjoint union of 21 
and 03 is the union of isomorphic copies of 21 and 03 with disjoint domains. As disjoint 
unions are unique up to isomorphism, we usually speak of the disjoint union of 21 and 
03, and denote it by 21 l+J 03. The disjoint union of a set of r-structures C is defined 
analogously (and the disjoint union of an empty set of structures is the r-structure 
with empty domain). A structure is called connected if it is not the disjoint union 
of two non-empty structures. A maximal connected substructure of 03 is called a 
connected component of 03. 

Definition 1.1.6. We say that a class C of relational structures is 

• closed under homomorphisms iff whenever 21 G C and 21 homomorphically 
maps to 03 then 03 G C; 

• closed under inverse homomorphisms iff whenever 03 G C and 21 homomor- 
phically maps to 03 then 21 G C; 

• closed under (finite) disjoint unions iff whenever 21, 03 G C then the disjoint 
union of$i and 03 is also in C. 

Note that a class C of r-structures is closed under inverse homomorphisms if and 
only if its complement in the class of all r-structures is closed under homomorphisms. 
When a class is closed under inverse homomorphisms, or closed under homomor- 
phisms, it is in particular closed under isomorphisms. The following is a simple, but 
fundamental lemma for CSPs. When Af is a class of r-structures, we say that a struc- 
ture 21 is AJ -free if no 03 G Af homomorphically maps to 21. The class of all finite 
Af- free structures we denote by Forb(A/"). 

Lemma 1.1.7. Let r be a finite relational signature, and C a class of finite r- 
structures. Then the following are equivalent. 

(1) C = CSP(03) for some r-structure 03. 

(2) C = Forb(AT) for a class of finite connected r-structures Af. 

(3) C is closed under disjoint unions and inverse homomorphisms. 

(4) C = CSP(03) for a countably infinite r-structure 03. 

Proof. It suffices to prove the implications (1) =>■ (2) =>■ (3) => (4). For the im- 
plication from (1) to (2), let Af be the class of all finite connected r-structures that do 
not homomorphically map to 03. Then by transitivity of the homomorphism relation, 
a r-structure 21 homomorphically maps to 03 if and only if no € G Af homomorphically 
maps to 21. 

(2) implies (3). Suppose (2), and let 2li and 2t 2 be two structures from Forb(A/"). 
If there were a homomorphism from one of the structures € £ Af into 2li W 2I2, then 
because £ is connected, it must already be a homomorphism into 2ti or 2L2, which is 
impossible. Hence, Forb(A/") is closed under disjoint unions. Closure under inverse 
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Triangle- Freeness 

INSTANCE: An undirected graph © 
QUESTION: Is © triangle-free? 



Acyclic-Bipartition 

INSTANCE: A digraph © 

QUESTION: Is there a partition V = V 1 W V 2 of the vertices V of © such that 
both ©[Vi] and <5[V 2 ] are acyclic? 

No-Mono-Tri 

INSTANCE: An undirected graph © 

QUESTION: Is there a partition V = V\ W V 2 of the vertices V of © such that 
both ©[Vi] and ©[V2] are triangle- free? 



FIGURE 1.2. Three computational problems that are closed under 
disjoint unions and inverse homomorphisms. 

homomorphisms follows straightforwardly from transitivity of the homomorphism re- 
lation. 

(3) implies (4). Suppose that C is a class of relational structures that is closed 
under disjoint unions and inverse homomorphisms. Let C be a subclass of C where 
we select one structure from each isomorphism class of structures in C. Let 03 be the 
(countably infinite) disjoint union over all structures in C (if C is empty then 03 is 
by definition the empty structure^) . Clearly, every structure in C homomorphically 
maps to 05. Now, let 21 be a finite structure with a homomorphism h to 03. By 
construction of 05, the set h(A) is contained in the disjoint union £ of a finite set 
of structures from C. Since C is closed under disjoint unions, £ is in C. Clearly, 21 
homomorphically maps to £, and because C is closed under inverse homomorphisms, 
21 is in C as well. □ 



Example 1.1.8. The computational problems in Figure 1.2 are closed under 
disjoint unions and inverse homomorphisms. Hence, Lemma |1.1.7| shows that they 
can be formulated as CSP(05) for some relational structure 05. It is easy to see that 
none of those three problems can be formulated as CSP(05) for a finite structure 03. 

We verify this for the problem of Triangle-freeness. For a fixed n, consider the 
graph that contains vertices x\, . . . ,x n , and that contains for every pair i,j with 1 < 
i < j < n two additional vertices Uij,Vij and the edges (xi, Uij), (uij,Vij), (vij,Xj). 
The resulting graph is clearly triangle-free. But note that every homomorphism / from 
this graph to a graph fj with strictly less than n vertices must identify at least two of 
the vertices x\, . . . ,x n . So suppose that f(x{) = f(xj). Because / is a homomorphism, 
we have that (f(xi), f(Ui,j)), {f(ui,j,f{vi,j)), {f(Vi,j),f(xj)) are edges in ft. Hence, fj 
either contains a triangle or a loop. In both cases, cannot be the template for the 
Triangle-Freeness. Hence we have ruled out all templates of size n—1. This concludes 
the proof since n was chosen arbitrarily. □ 

We close with an important concept for finite structures 03, the notion of core 
structures; generalizations to infinite structures 03 are presented in Section [3.6.3| Two 
structures 21 and 03 are called homomorphically equivalent if there exists a homomor- 
phism from 21 to 03 and vice versa. An embedding of 21 into 05 is an injective map 



^Structures with an empty domain are often forbidden in model theory. Lemma 
the places that motivates our decision to allow them in this text. 
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is one of 
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/ : A — > B such that (a 1: . . . , a/-) is in i? a if and only if (/(ai), . . . , /(a^)) is in 
An endomorphism of a structure 05 is a homomorphism from 05 to 03. 

Definition 1.1.9. A structure 05 is a core if all its endomorphisms are embed- 
ding^ For structures 21, 05 of the same signature, the structure 05 is called a core of 
21 if 05 is a core and homomorphically equivalent to 21. 

In fact, we speak of the core of a finite structure 21, due to the following fact, 
whose proof is easy and left to the reader. 

Proposition 1.1.10. Every finite structure 21 has a core. All cores of % are 
isomorphic. 

Core structures 03 have many pleasant properties when it comes to studying 



the computational complexity of CSP(05) (see for instance Proposition 1.2.9 below). 
Clearly, when 21 and 05 are homomorphically equivalent, then CSP(2l) = CSP(03). 
Therefore, and because of Proposition [LLlOj we can assume without loss of generality 
that a finite structure 03 is a core when studying CSP(05). We finally remark that 
structures with a one-element core have a trivial CSP. 

Proposition 1.1.11. Let 03 be a relational structure with a finite relational sig- 
nature and a one-element core. Then CSP(03) is in P. 

Proof. Let € be the core of 03, and let c be the unique element of £. The 
problem CSP(05) can be solved as follows. Let 21 be an input structure of CSP(05). If 
there is (t±, . . . , t n ) € i? a such that (c, . . . , c) ^ then reject. Otherwise accept. □ 

1.2. The Sentence Evaluation Perspective 

Let t be a relational signature. A first-order r-formula 4>{x\, . . . ,x n ) is called 
primitive positive if it is of the form 

where ipi, . . . , tpi are atomic r -formulas, i.e., formulas of the form R(yi, . . . , yk) with 
R e t and yi 6 {x\, . . . , x m }, of the form y = y' for y, y' £ {x\, . . . , x m }, or _L (for 
false). As usual, formulas without free variables are called sentences. Note that we 
do not need a symbol T for true since we can use the primitive positive sentence 
3x. x = x to express it; nonetheless, we sometimes use T as a shortcut for 3x. x = x. 
From the model-checking perspective, CSPs are defined as follows. We will see 



(in Propositions 1.2.4 and 1.2.5) that this definition is essentially the same definition 



as Definition 11.1.1 and that the differences are a matter of formalization. 



Definition 1.2.1. Let 05 be a (possibly infinite) structure with a finite relational 
signature r. Then CSP(05) is the computational problem to decide whether a given 
primitive positive t -sentence <j) is true in 03. 

The given primitive positive r-sentence 4> is also called an instance of CSP(03). 
The conjuncts of an instance <fi are called the constraints of <j). A mapping from the 
variables of <fi to the elements of B that is a satisfying assignment for the quantifier- free 
part of <fi is also called a solution to <j). 

Some authors omit the (existential) quantifier-prefix in instances <p of CSP(03), 
and the question is then whether <fi is satisfiable over 05. Clearly, this is just re- 
phrasing the problem above, but it explains the terminology of satisfiable and unsat- 
isfiable (rather than true and false) instances of CSP(05). 



2 For finite structures 03, injective self-maps must be bijective, and in fact every injective homo- 
morphism of a structure 93 must be an isomorphism. For infinite structures, however, this need not 
be true, and for reasons that become clear in Chapter jS] we chose the present definition. 
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3SAT 

INSTANCE: A prepositional formula in conjunctive normal form (CNF) with 
at most three literals per clause 

QUESTION: Is there a Boolean assignment for the variables such that in each 
clause at least one literal is true? 

Positive l-in-3-3SAT 

INSTANCE: A prepositional 3SAT formula with only positive literals 
QUESTION: Is there a Boolean assignment for the variables such that in each 
clause exactly one literal is true? 

Positive Not-All-Equal-3SAT 

INSTANCE: A propositional 3SAT formula with only positive literals 
QUESTION: Is there a Boolean assignment for the variables such that in each 
clause neither all three literals are true nor all three are false? 



Figure 1.3. Three Boolean satisfiability problems from the list of 



NP-complete problems of 95 that can be formulated as CSP(23) for 
appropriate 93. 

Example 1.2.2 (Boolean satisfiability problems). There are many Boolean sat- 
isfiability problems that can be cast as CSPs. Well-known examples are 3SAT (see 



Figure 1.3), and the restricted versions of 3SAT called l-in-3-3SAT and NOT-ALL- 



EQUAL-3SAT 95 . These three problems are NP-complete. An interesting feature 
of the last two problems is that they remain NP-complete even when all clauses in 
the input only contain positive literals. With this additional restriction, the problems 
are called positive l-in-3-3SAT and positive NOT-ALL-EQUAL-3SAT, and their def- 
inition can be found in Figure |1.3| 

All of these problems can be formulated as CSP(Q5), for an appropriate 2-element 
structure *B. Positive l-in-3-3SAT can be formulated as CSP(Q3) for the template 

<B = ({0, 1}; HNS) where 1IN3 = {(0, 0, 1), (0, 1, 0), (1, 0, 0)} , 

and Positive-Not-All-Equal-3SAT as CSP(93) for the template 

*B = ({0,1} ,NAE) where NAE = {0, l} 3 \ {(0, 0, 0), (1, 1, 1)} . 

These problems can also be formulated as CSPs if we do not impose the restriction 
that all literals are positive; the corresponding problems are then called l-in-3-3SAT 
and Not- All- Equal- 3 SAT, respectively. The idea is to use a different ternary relation 
for each of the eight ways how three distinct variables in a clause with three literals 
might be negated. In this way, we can also model the classical problem of 3SAT 



(again, see Figure 1.3 ) as a CSP. Clauses of the type xW y\/ ~^z in the 3SAT problem 
will then be viewed as constraints i? +H (x, y, z), where i? +H = {0, l} 3 \ {(0, 0, 1)} 
(here, x,y,z are not necessarily distinct variables). Similarly, the well-known 2SAT 
problem can be viewed as CSP(({0, 1}; R++ , R+~ , R~+ , R—)) where 

i? ++ = {(0,l),(l,0),(l,l)}, 

R+- -{(0,0), (1,1), (1,0)}, 

iT+ = {(1,1), (0,0), (0,1)}, and 

R— ={(1,0), (0,1), (0,0)}. 

□ 
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Example 1.2.3 (Disequality constraints). Consider the problem CSP((N; =, ^)). 
An instance of this problem can be viewed as an (existentially quantified) set of 
variables, some linked by equality, some by disequalitjlj constraints. Such an instance 
is false in (N; =, ^f) if and only if there is a path x\, . . . ,x n from a variable x\ to 
a variable x n that uses only equality edges, i.e., l Xi = Xi + \ is a constraint in the 
instance for each 1 < i < n — 1, and additionally l x\ ^ Xn is a constraint in the 
instance. Clearly, it can be tested in linear time in the size of the input instance 
whether the instance contains such a path. □ 

1.2.1. Canonical conjunctive queries. To every relational r-structure 21 we 
can associate a r-sentence, called the canonical conjunctive query of 21, and denoted by 
Q(2l). The variables of this sentence are the elements of 21, all of which are existentially 
quantified in the quantifier prefix of the formula, which is followed by the conjunction 
of all formulas of the form R(a\, . . . , a,k) for R € r and tuples (di, . . . , dfc) € i? a . 

For example, the canonical conjunctive query (^(K^) of the complete graph on 
three vertices K$ is the formula 

3u3v3w (E(u, v) A E(v, u) A E(v, w) A E(w, v) A E(u, w) A E(w, u)) . 

The proof of the following proposition is straightforward. 

Proposition 1.2.4. Let 03 be a structure with finite relational signature t, and 
let 21 be a finite r-structure. Then there is a homomorphism from 21 to 03 if and only 
i/Q(2t) is true in 03. 

1.2.2. Canonical databases. To present a converse of Proposition |1.2.4| we 
define the canonical database D((/>) of a primitive positive r-formula, which is a rela- 
tional r-structure defined as follows. We require that <j> is not the formula _L. If <\> 
contains an atomic formula of the form x = y, we remove it from <j), and replace all 
occurrences of x in (f> by y. Repeating this step if necessary, we may assume that 4> 
does not contain atomic formulas of the form x = y. 

Then the domain of D(</)) is the set of variables (both the free variables and the 
existentially quantified variables) that occur in <fi. There is a tuple (v±, . . . , Vk) in a 
relation R of D(</>) iff <fi contains the conjunct R(v±, . . . , Vk)- The following is similarly 



straightforward as Proposition 1.2.4 



Proposition 1.2.5. Let 53 be a structure with signature t, and let (f> be a r- 
sentence other than _L. Then (f> is true in 58 if and only if D(</>) homomorphically 
maps to 03. 



Due to Proposition 1.2.5 and Proposition 1.2.4[ we may freely switch between the 



homomorphism and the logic perspective whenever this is convenient. In particular, 
instances of CSP(Q3) can from now on be either finite structures 21 or primitive positive 
sentences cj). 

1.2.3. Expansions. Let 21 be a r-structure, and let 21' be a r'-structure with 
r C r'. If 21 and 21' have the same domain and i? a = i? a for all R G r, then 21 is 
called the r-reduct (or simply reduct) of 2t', and 21' is called a t' -expansion (or simply 
expansion) of 21. When 21 is a structure, and R is a relation over the domain of 21, 
then we denote the expansion of 21 by R by (21, R) . 

The following lemma says that we can expand structures by primitive positive 
definable relations without changing the complexity of the corresponding CSP. Hence, 
primitive positive definitions are an important tool to prove NP-hardness: to show 

^We deliberately use the word disequality instead of inequality, since we reserve the word in- 
equality for the relation x < y. 
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that CSP(03) is NP-hard, it suffices to show that there is a primitive positive definition 
of a relation R such that CSP((03, R)) is already known to be NP-hard. Stronger tools 
to prove NP-hardness of CSPs will be introduced in Section [575] 



LEMMA 1.2.6. Let 03 be a structure with finite relational signature, and let R be a 
relation that has a primitive positive definition in 03. Then CSP(03) and CSP((03, R)) 
are linear-time equivalent. They are also equivalent under deterministic log-space 
reductions. 

Proof. It is clear that CSP(03) reduces to the new problem. So suppose that 
(j) is an instance of CSP((Q3, R)). Replace each conjunct R{x\, . . . , x{) of <j) by its 
primitive positive definition ip(xi, . . . , x{). Move all quantifiers to the front, such 
that the resulting formula is in prenex normal form and hence primitive positive. 
Finally, equalities can be eliminated one by one: for equality x = y, remove y from 
the quantifier prefix, and replace all remaining occurrences of y by x. Let ip be the 
formula obtained in this way. 

It is straightforward to verify that <j) is true in (03, R) if and only if ?/> is true in 03, 
and it is also clear that ip can be constructed in linear time in the representation size 
of (j). The observation that the reduction is deterministic log-space, we need the recent 



result that undirected reachability can be decided in deterministic log-space [171 . □ 



Example 1.2.7. The relation NAE{x\, X2, x 3 ) has the following primitive positive 
definition in ({0, 1}; 1IN3). 

3ui,u 2 ,u 3 ,vi,V2,v 3 ,zi, z 2 , z 3 (lIN3(xi,ui,vi) A lIN3(x 2 ,u 2 ,v 2 ) A HN3(x 3 ,u 3 ,v 3 ) 

MIN3{v 1 ,u 2 ,z 1 )MIN3(v 2 ,u 3 ,z 2 ) A lIN3(v 3 , Ul , z 3 ) A 1IN3(zi,z 2 ,z 3 )) 

To see that this works, note that when x\ = x 2 = x 3 = 1, then the first three 
conjuncts imply that u\ — V\ = u 2 — v 2 = u 3 = v 3 = 0, and the next three 
conjuncts imply that z\ = Z2 = z 3 = 1, and hence the last conjunct is violated. When 
X\ = x 2 = x 3 = 0, then the first conjunct implies that u\ — and v\ = 1, or u% = 1 
and v\ — 0. In both cases, the fourth conjunct implies that Z\ = 0. Similarly, we can 
infer that Z2 — z 3 = 0. Whence, the last conjunct is violated. 

Now consider the case when exactly one out of X\, x 2 , x 3 is 0. Since the formula 
is symmetric with respect to Xx,x 2 ,x 3 , we assume without loss of generality that 
Xi = 0, x 2 = 1, £3 = 1. Then we can set U\ = z\ = z 2 = 1, and v\ = u 2 = v 2 = u 3 = 
v 3 = z 3 = and satisfy all conjuncts. Similarly, when exactly two out of x±,x 2 , x 3 are 
0, we assume without loss of generality that x± = 1, x 2 = X3 = 0. Then we can set 
ui = vi = u 2 — u 3 = z 2 = z 3 = and Z\ — V2 = v 3 = 1 and satisfy all conjuncts. □ 

An automorphism of a structure 03 with domain B is an isomorphism between 03 
and itself. When applying an automorphism a to elements from B we omit brackets, 
that is, for b g B we write ab instead of a(b). The set of all automorphisms a of 03 
is denoted by Aut(03), and a~ l denotes the inverse map of a. Let (pi, . . . , bk) be a 
fc-tuple of elements of 03. A set of the form S = {(abi, . . . ,abk) | a € Aut(03)} is 
called an orbit of k-tuples (the orbit of (pi, ... , bk))- 

Lemma 1.2.8. Let 03 be a structure with a finite relational signature and do- 
main B, and let R — {(61, . . . , bk)} be a k-ary relation that only contains one tuple 
(61, ... , bk) G B k . If the orbit of (pi, . . . , bk) in 03 is primitive positive definable, then 
there is a polynomial-time reduction from CSP((23,i?)) to CSP(03). 

Proof. Let <j) be an instance of CSP((03,i?)) with variable set V. If </> contains 
two constraints R(xi, . . . , Xk) and R(yi, ■ ■ ■ , yk), then replace each occurrence of yi 
by xi, then each occurrence of y 2 by x 2 , and so on, and finally each occurrence of 
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Uk by Xk- We repeat this step until all constrains that involve R are imposed on the 
same tuple of variables [x\, . . . , Xk). Replace R(x\, . . . , Xk) by the primitive positive 
definition 9 of its orbits in 03. Finally, move all quantifiers to the front, such that 
the resulting formula ip is in prenex normal form and thus an instance of CSP(03). 
Clearly, ip can be computed from <p in polynomial time. We claim that <p is true in 
(03, R) if and only if ip is true in 03. 

Suppose <fi has a solution s : V — > B. Let s' be the restriction of s to the variables 
of V that also appear in (p. Since (b%,... ,b n ) satisfies 9, we can extend s' to the 
existentially quantified variables of 9 to obtain a solution for ip. In the opposite 
direction, suppose that s' is a solution to ip over 03. Let s be the restriction of s' to 
V. Because (s(x\), . . . , s(xk)) satisfies 9 it lies in the same orbit as (b%, . . . , bk). Thus, 
there exists an automorphism a of 03 that maps (s(xi), . . . , s(xf.)) to . . . , bf~). 
Then the extension of the map x \— > as(x) that maps variables yi of <p that have been 
replaced by xi in ip to the value bi is a solution to (p over (03, R). □ 

Recall from Section [l.l| that every finite structure €. is homomorphically equivalent 
to a core structure 03, which is unique up to isomorphism. For core structures, all 
orbits are primitive positive definable. This fact has an elementary proof for finite 
structures 03; however, the same is true for a large class of infinite structures, and 
presented in Chapter[3j Theorem 3.6.11 Since Theorem 3.6.11 implies the following 



proposition, we omit the proof at this point. 

Proposition 1.2.9. Let 03 be a finite core structure. Then orbits of k -tuples of 
03 are primitive positive definable. 

Proposition |1 .2.9] and Lemma [l.2.8| can be combined to the following well-known 
consequence. 

Corollary 1.2.10. Let 03 be a finite core structure with elements b\,...,b n . 
Then CSP(03) and CSP((03, {&i}, . . . , {&«})) are polynomial time equivalent. 

1.3. The Satisfiability Perspective 

Yet another perspective on the constraint satisfaction problem translates not only 
the instances, but also the template of the CSP into logic. This leads to a natural 
perspective for various model-theoretic considerations in Chapter [3j Moreover, this 
perspective is convenient when discussing the literature that uses relation algebras in 



the context of constraint satisfaction 83 135 ; this connection will be described in 
Section ll.3.2l and Section fl. 3.31 

We use the opportunity to introduce some inevitable terminology from logic. We 
assume that the reader is already familiar with basic terminology of first-order logic; 
a highly recommendable text-book is Hodges 



114 



1.3.1. Theories. A (first-order) theory is a set of first-order sentences. When 
the first-order sentences are over the signature r, we also say that T is a r-theory. 
A model of a r-theory T is a r-structure 03 such that 03 satisfies all sentences in T. 
Theories that have a model are called satisfiable. 

Definition 1.3.1. Let r be a finite relational signature, and let T be a r-theory. 
Then CSP(T) is the computational problem to decide for a given primitive positive 
t -sentence <j> whether TL) {<p} is satisfiable. 

The satisfiability perspective on CSPs stresses the fact that the problem CSP (03) 
is fully determined by the first-order theory of 03, that is, by the theory that contains 
exactly those sentences that are true in 03. In fact, it is already determined by the 
primitive positive sentences that are false in 03. 
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Example 1.3.2. Let T be the theory that consists of the following sentences. 

Vx, y, z ((x < y A y < z) — > x < z) (transitivity) 
Vx, y -i{x < x) (irreflexivity) 
Vx, y, z ((x < y) V (y < x) V (x = y)) (totality) 



It is straightforward to verify that CSP(T) equals CSP((Z; <)) (Example 1.1.2). □ 



When T is a theory and <f> a sentence, we say that T entails 4>, in symbols T \= <p, 
if every model of T satisfies cf>. The following is clear from the definitions. 

Proposition 1.3.3. Let t be a finite relational signature, and let T be a r-theory. 
Suppose that T entails exactly those negations of primitive positive sentences <f> such 
that 23 |= <fi. Then CSP(T) and CSP(23) are the same problem. 

We have already seen that two structures that are homomorphically equivalent 
have the same CSP; the following provides a necessary and sufficient condition that 
describes when two theories have the same CSP. Its proof is simple once the relevant 
notions from logic are introduced, and will be given in Section [2.1. 3 1 

Proposition 1.3.4. Let T and T' be two first-order theories. Then the following 
are eguivalent. 

• CSP(T) eguals CSP(T'). 

• Every model of T' has a homomorphism to some model of T , and every 
model of T has a homomorphism to some model of T' . 

• T and T' entail the same negations of primitive positive sentences. 

We now present a couple of basic observations relating the definition of CSP(T) 
for a theory T with the definition of CSP(23) for a relational structure 05. We start 
with the observation that there are theories T such that CSP(T) cannot be formulated 
as CSP(Q3). 

Example 1.3.5. Let r be the signature {R, G}, where R and G are unary relation 
symbols, and let T be the r-theory {Vx, y ->(R(x) A G(y))}. There is no structure 
<B such that CSP( S 8) equals CSP(T). To see this, observe that T U {3x.R(x)} is 
satisfiable, and TU {3x.G(x)} is satisfiable. But any structure 03 that satisfies both 
3x.R(x) and 3x.G(x) also satisfies 3x,y(R(x) AR(y)), which shows that CSP(25) and 
CSP(T) are different. □ 

We next characterize those satisfiable theories T that have a model 23 such that 
CSP(25) and CSP(T) are the same problem. 

Proposition 1.3.6. Let r be a finite relational signature, and letT be a satisfiable 
first-order r-theory. The following are eguivalent. 

(1) There is a structure 25 such that CSP(23) and CSP(T) are the same problem. 

(2) There is a model 25 of T such that CSP(23) and CSP(T) are the same 
problem. 

(3) For all primitive positive r -sentences <f>\ and <p2, if T U {</>i} is satisfiable 
and T U {^2} is satisfiable then T U {<f>i, ^2} is satisfiable as well. 

(4) T has the Joint Homomorphism Property (JHP), that is, when T has models 
21 and 23, then it also has a model £ such that both 21 and 23 homomorphically 
map to €. 



We defer the proof of this (simple) fact to Section 2.1.3 when we have some more 
concepts from logic available. 
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1.3.2. Relation Algebras. Many interesting infinite-domain CSPs, in partic- 
ular in spatial and temporal reasoning, have been studied in the context of relation 

In Artificial 



algebras (many examples will be given in Section 1.5 and Chapter |4|) 
Intelligence, relation algebras are used as a framework to formalize and study qual- 
itative reasoning problems 83||lll[[l35 . From the perspective of this thesis, the 
relation algebra approach does not bring substantially new tools, and Section |1.3.2| 
and Section [l.3.3| can be safely skipped. Here we nonetheless give a quick introduction 
in order to link the relation algebra terminology with the satisfiability perspective on 



the CSP (Section 1.3.3) 



Relation algebras are designed to handle binary relations in an algebraic way; we 
follow the presentation in 



111 



Definition 1.3.7. A proper relation algebra is a domain D together with a set 
B of binary relations over D such that 

(1) Id := {(x,x) \ xeD}eB; 

(2) // Bi and B 2 are from B, then B x V B 2 := B Y U B x e B; 

(3) l:={j ReB ReB; 

(4) € B; 

(5) IfBeB, then -B := 1 \ B x e B; 

(6) IfBeB, then B~ := {(x,y) \ (y,x) G B} € B; 

(7) // B\ and B 2 are from B, then B\ o B 2 G B; where 

B\ o B 2 := {{x, z) | 3y.((x, y) € B x A {y, z) e B 2 )} . 

We want to point out that in this standard definition of proper relation algebras 
it is not required that 1 denotes D 2 (and this will be used for instance in the proof of 
Proposition 1.3.161. However, in most examples that we encounter, 1 indeed denotes 
D 2 . The minimal non-empty elements of B with respect to set-wise inclusion are 
called the basic relations of the relation algebra. 

Example 1.3.8 (The Point Algebra). Let D = Q be the set of rational numbers, 
and consider 

B = {<,>,=,<, >,0,Q 2 }. 
Those relations form a proper relation algebra (with atoms <,>,=, and where 1 
denotes Q 2 ) which is one of the most fundamental relation algebras and known under 
the name point algebra. □ 

When B is finite, every relation in B can be written as a finite union of basic rela- 
tions, and we abuse notation and sometimes write R = {B\ 1 . . . , B^} when B±, . . . , Bk 
are basic relations, R € B, and R = B\ U • • • U B^. Note that composition of basic 
relations determines the composition of all relations in the relation algebra, since 

RioR 2 = |J B x oB 2 . 



An abstract relation algebra (Definition 1.3.9 below) is an algebra with signature 
Id, 0, 1, — , V, o that satisfies laws that we expect from those operators in a proper 
relation algebra. 



Definition 1.3.9 (Compare [83 111 |135| ). An (abstract) relation algebra A is 
an algebra with domain A and signature {V, — , 0, 1, o, w , Id} such that 

• the structure (A; V, A, — , 0, 1) is a Boolean algebra where A is defined by 
(x, y) i y —(—x V —y) from — and V; 

• o is an associative binary operation on A; 

• (a")" = a for all a £ A; 
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FIGURE 1.4. The composition table for the basic relations in the 
point algebra. 

• Ido a = a o Id = a for all a G A; 

• a o (6 V c) = a o b V a o c; 

• (a V b)~ = a~ V 

• (-a)" = -( a ~); 

• (a o by — b" o a"; 

• (a o 6) A c~ = (loc)Aa" = 0. 

We define x<yhyx/\y — x. A subalgebra B of a relation algebra A with 
domain A is a relation algebra with domain B C A such that for every function / of 
A, the element obtained by applying / to elements from B is again in B. 

A representation (D, i) of A consists of a set D and a mapping i from the domain 
A of A to binary relations over D such that the image of i induces a proper relation 
algebra B, and i is an isomorphism with respect to the functions (and constants) 
{V, — , 0, l,o," ,Id}. In this case, we also say that A is the abstract relation algebra 
ofB. 



There are finite relation algebras that do not have a representation 144 . Note 
that when (D, i) is a representation of A, then i(a) is a basic relation of the induced 
proper relation algebra if and only if a ^ 0, and for every b < a we have b = a or 
6 = 0; we call a an atom of A. Using the axioms of relation algebras, it can be shown 
that the composition operator is uniquely determined by the composition operator 
on the atoms. Similarly, the inverse of an element a € A is the disjunction of the 
inverses of all the atoms below a. 

Example 1.3.10. The (abstract) point algebra is a relation algebra with 8 ele- 
ments and 3 atoms, =, <, and >, and can be described as follows. The composition 
operator of the basic relations of the point algebra is shown in the table of Figure [L~4| 
By the observation we just made, this table determines the full composition table. 
The inverse of < is > , and Id denotes = which is its own inverse. This fully determines 
the relation algebra. 

We can obtain a representation of the abstract point algebra from the point 
algebra with domain Q presented in Example |1.3.8| in the obvious way. Note that this 
is not the only representation of the abstract point algebra: another representation 
can be obtained by taking [0, 1] in place of Q. While in any representation the relation 
for < has to be transitive and dense, it need not be unbounded. □ 

1.3.3. Network Satisfaction Problems. The central computational problems 



that have been studied for relation algebras are network satisfaction problems 83|111 



135 , defined as follows. Let A be a finite relation algebra with domain A. An (A.-) 
network N = (V; f) consists of a finite set of nodes V and a function / : V x V ^ A. 
Two types of network satisfaction problems have been studied for A-networks. The 
first is the network satisfaction problem for a (fixed) representation of A, defined as 
follows. 



Definition 1.3.11. Let (D,i) be a representation of a finite relation algebra A. 
Then the network satisfaction problem for (D, i) is the computational problem to 
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decide whether a given A-network N is satisfiable with respect to (D,i), that is, 
whether there exists a mapping s : V — > D such that (s(u), s(v)) £ i(f(u,v)) for all 
u, v £ V . 

The second problem is the (general) network satisfaction problem for A. 

Definition 1.3.12. Let A. be a finite relation algebra. Then the network sat- 
isfaction problem for A is the computational problem to decide whether for a given 
A-network N there exists a representation {D,i) of A such that N is satisfiable with 
respect to (D, i). 

It is not surprising that every network satisfaction problem for a fixed repre- 
sentation is closely related to a corresponding constraint satisfaction problem; this 
correspondence will be described in the following. This also applies to the general 
network satisfaction problem: it turns out that every finite relation algebra A that 
has a representation also has a representation (D,i) such that the general network 
satisfaction problem for A and the network satisfaction problem for (D, i) are one 



and the same problem (Proposition 1.3.16) 



To present the link between network satisfaction problems and CSPs as defined 
earlier we need the following notation. Let ta be a signature consisting of a binary 
relation symbol R a for each element a £ A. When (D, i) is a representation of ta, then 
this gives rise to a TA-structurc %D,i in a natural way: the domain of the structure 
is D, and the relation symbol R a is interpreted by i(a). 

We can associate to each A-network TV = (V; f) a primitive positive TA-sentence 
(j) 7v ) in the following straightforward way: the variables of 4>n are V, and <j) contains 
the conjunct R a (u,v) iff f(u,v) = a. Conversely, we can associate to each primitive 
positive TA-sentence <j> with variables V a network as follows. The domain of is 
V . Let u, v £ V, and list by ai, . . . , ak all those elements a of A such that <f> contains 
the conjunct R a (u, v). Then define f(u, v) = a for a = (ai A a 2 A • • • A a^) (if k = 0, 
then a = by definition). 

The following link between the network satisfaction problem for a fixed represen- 
tation (D,i) of A, and the constraint satisfaction problem for ^Bd.i is straightforward 
from the definitions. 

Proposition 1.3.13. Let A be a finite relation algebra with representation (D,i). 
Then an A-network N is satisfiable with respect to (D,i) if and only if $$0,1 H 4>n- 
Conversely, satisfies a primitive positive Tj^-sentence <fi if and only if TV^ is 

satisfiable with respect to (D,i). 

Proposition |1. 3. T3| shows that network satisfaction problems for fixed representa- 
tions essentially are constraint satisfaction problems, and that the differences are only 
a matter of formalization. To also relate the general network satisfaction problem for 
a finite relation algebra A to a constraint satisfaction problem, we define in Figure [T75| 



the first-order TA-theory Ta (as in 111 , Section 2.3). The models of Ta correspond 



to the representations of A, as described in the following. 

Proposition 1.3.14. Let A be a finite relation algebra. When^B models Ta, then 
(B,i) where B is the domain o/*B and i is given by i(a) = is a representation of 
A. Conversely, for every representation (D,i) of A the T^-structure *Bn,i is a model 

"f I A. 

Proof. The proof is straightforward by matching the sentences in Ta with the 
items of Definition 11.3.71 □ 

Corollary 1.3.15. Let A be a finite relation algebra. Then an A-network N 
is satisfiable if and only if 4>n U Ta is satisfiable. Conversely, when <f> is a primitive 
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T A := {Vx, y.^0(x, y) A (Id(x, y) e> x = y)} (1) 
u{Vx,y.(l(x,y)^ \/ R a (x,y))} (2) 



a£A 



U (J {Vx,y.(R a -(x,y)&R a (y ) x)A(R- a (x,y)-&^R a (x,y))} (3) 
U |J {V.T,y.(i? QV6 (a;,y) ^ (i? a (x,?/) Vi? fc (a;,y)))} (4) 
U (J {Va; ! z.( J R o6(x,2;)^3i/.( J R (ar,y) Ai?6(2/,^)))} (5) 

Figure 1.5. The definition of the TA-theory T A . 

positive Tj^-sentence, then the K-network is satisfiable if and only if <j> U Ta is 
satisfiable. 

It is easy to see that Ta has the Joint Homomorphism Property (JHP, introduced 



in Proposition 2.4.6 1; in fact, the disjoint union of two models of Ta is again a model 
of T"a- 

Proposition 1.3.16. Every finite relation algebra A that has a representation 
also has a representation (D;i) whose network satisfaction problem is the same prob- 
lem as the general network satisfaction problem for A. 

Proof. Since A has a representation, and by Proposition [L3T4] the theory Ta 
is satisfiable. Since Ta also has the JHP, we can apply Proposition |2.4.6| to obtain 
a model 03 of Ta. with domain B be such that CSP(03) and CSP(Ta) are the same 



problem. Then by Proposition 1.3.14 for i given by i(a) — i?®, the relation algebra 
A has the representation (B, i). 

We then have for all A-nctworks N the following equivalences. 



N is satisfiable <4> 4>n U Ta. is satisfiable (Corollary 1.3.15) 



<^ 03 |= (f>N (by the properties of 03) 



4=> 4>n is satisfiable wrt. (B,i) (Proposition 1.3.131 

This concludes the proof that the representation (B, i) of A has a network satisfaction 
problem that equals the general network satisfaction problem for A. □ 

In combination with Proposition |1.3.13[ this implies that also every general net- 
work satisfiability problem is essentially the same problem as a CSP for an infinite 
template. 

We close this section by discussing the weaknesses of the relation algebra approach 
to constraint satisfaction. First of all, the class of problems that can be formulated as 
a network satisfiability problem for finite relation algebra A is severely restricted. The 
relations that we allow in the input network are closed under unions; this introduces 
a sort of restricted disjunction that quickly leads to NP-hardness, and indeed only 
a few exceptional situations have a polynomial-time tractable network satisfiability 
problem [111] . The typical work-around here is to introduce another parameter, 
which is a subset B of the domain of A, and to study the network satisfaction problem 
for networks N — (V; f) where the image of / is contained in B. Such subsets B are 
often called a fragment of A. Note that such an additional parameter is not necessary 
for CSPs as studied in this thesis: with the techniques of this section, we can also 
formulate the network satisfaction problems for fragments of A as CSPs. 



1.4. THE EXISTENTIAL SECOND-ORDER PERSPECTIVE 



17 



Also note that the network satisfaction problem is restricted to binary relations, 
whereas many important CSPs can only be formulated in a natural way with higher- 



ary relations (see e.g. Section 1.5.2 or Section 1.5.8). As we have seen in Proposi- 
tion 1.3.16| every network satisfaction problem can be formulated as CSP(Q3) for an 
appropriate infinite structure 25; but as the above remarks show, only a very small 
fraction of CSPs can be formulated as a network satisfaction problem. Even though 
only very specific CSPs can be formulated as the network satisfaction problem for 
a finite relation algebra A, there are hardly any additional techniques available for 
studying network satisfaction problems. The tools we have for network satisfaction 
usually also apply to constraint satisfaction problems. 

The study of composition of relations in the context of the network satisfiability 
problem is usually justified by the fact that a network with constraints over the 
relation R o S can be simulated by networks that only have constraints over the 
relation R and over the relation S. To study the computational complexity of the 
network satisfaction problem for a fragment B of a relation algebra A, one therefore 
typically computes the closure of B under the operations of the relation algebra. 
But note that every binary relation in the closure of B is also primitive positive 
definable in any representation of A, and that the converse of this statement is false. 
Since the computational complexity is preserved also for expansions by primitive 
positive definable relations (see Lemma 1.2.61, primitive positive definitions therefore 
appear to be the more appropriate tool for studying network satisfaction problems. 
Apart from being more powerful, primitive positive definability has another advantage 
in comparison to closure in relation algebras: while the latter is intricate and not 
well-understood, we can offer a powerful Galois theory to study primitive positive 
definability of relations (see Chapter [5]). 



1.4. The Existential Second-Order Perspective 

By a famous result of Fagin, which will be reviewed below, the complexity class NP 
corresponds exactly to those problems that can be formulated in existential second- 
order logic (ESO). An important fragment of ESO that is particularly natural when it 
comes to the formulation of CSPs is the logic called SNP, introduced by Papadimitriou 
and Yannakakis [165] . An existential second-order sentence is in SNP (for strict NP) 
if its first-order part is universal. There are many links between constraint satisfaction 
and the complexity class SNP; many of those go back to |89| and [90|, some others 
that we present here are new. 

SNP is often a convenient way to specify CSPs. However, not every problem 
in SNP is a CSP. In this section we present a syntactic condition that implies that 
an SNP sentence describes a problem of the form CSP (05) for an infinite structure 
25. Conversely, if an SNP sentence describes a CSP, then there is an equivalent SNP 
sentence that satisfies the syntactic condition. 

The special case in which all existentially quantified relations are unary, known 
as monadic SNP, deserves special attention, and will be discussed at the end of this 
section. 



1.4.1. Fagin's theorem. We start by reviewing Fagin's theorem (see e.g. |85| ). 
Fix a finite relational signature r. Let C be a class of finite r-structures that is closed 
under isomorphisms (that is, if 05 G C, and 21 is isomorphic to 25, then 21 € C). We 
also fix some standard way to code relational structures as finite strings so that they 



can be given as an input to a Turing machine, see again 85 . Wc say that C is in NP 



when there exists a non-deterministic polynomial time algorithm that accepts exactly 
the structures from C under this representation. 
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A sentence of the form 3Ri , . . . , R m . <f> where is a first-order sentence with 
signature r U {Ri, ■ ■ ■ ,R m } is called an existential second-order sentence. When a 
structure 21 satisfies $ (and this is defined in the obvious way, see e.g. [85| ), we write 
21 |= 

Theorem 1.4.1 (Fagin's Theorem, see e.g. [ 85 ) . An isomorphism-closed class 
of finite t -structures is in NP if and only if there exists an existential second-order 
sentence $ that describes C in the sense that 

21 G C if and only if 2t |= <& . 

1.4.2. SNP. An SNP sentence is an existential second-order sentence with a 
universal first-order part, i.e., a sentence of the form 

3R 1 ,...,R k .Vx 1 ,...,x n . (j) 

where (j> is quantifier-free and over the signature r U {Ri, ■ . ■ , Rk}- The class of 
problems that can be described by SNP sentences is called SNP, too. 

Example 1.4.2. The problem CSP((Z; <)) can be described by the following 
SNP sentence. 

BTVx, y,z((x <y^ T(x, y)) 

A((T(x,y)AT(y,z))^T{x,z)) A-.T{x,x)) 

□ 



Example 1.4.3. The Betweenness problem CSP((Z; Betw)) (Example 1.1.31 can 
be described by the following SNP sentence. 

3TV.T, y,z(-.T(x, x) A ((T(x, y) A T(y, z)) => T(x, z)) 

A (Betw(x, y, z) => ((T(x, y) A T(y, z)) V (T(z, y) A T(y, x)))) 

□ 

Example 1.4.4. The problem whether a given undirected graph can be parti- 
tioned into two triangle-free graphs (this problem has been called No-Mono- Tri in 
Example 1.1.8) can be described by the SNP sentence. 

3MVx,y,z (-.(M(x) A M(y) A M(z) A E(x,y) A E{y,z) A E{z,x)) 

A-.(-.M (a;) A -iM (y) A ->M (z) A E(x, y) A E(y, z) A E[z, x))) 



The following fundamental lemma for SNP sentences is due to Feder and Vardi 90 



and can be shown by a simple compactness argument (Theorem 2.3.1). 

Lemma 1.4.5 (from |90|). Let 21 be an infinite structure, and $ an SNP sentence. 



Then 21 |= $ if and only if 21' |= $ for all finite induced substructures 21' o/2l. 

Since every finite induced substructure of 05 homomorphically maps to 03, and 
therefore satisfies $, we have the following consequence. 

Corollary 1.4.6. Let $ be an SNP sentence that describes CSP(*8) for a struc- 
ture 03 . Then 03 itself satisfies $ . 

1.4.3. SNP and CSPs. We say that two SNP sentences $ and \I/ are equivalent 
if for all structures (equivalently: all finite structures) 21 we have 21 |= $ if and only 
if 21 |= \E f . We assume in the following that the first-order part cf> of $ is written in 
conjunctive normal form. 

Definition 1.4.7. Let $ be an SNP sentence whose unquantified relation symbols 
are from the signature r. Then $ is called monotone if each literal o/$ with a symbol 
from t U {=} is negative, that is, of the form ->R(x), for R 6 (r U {=})• 
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In particular, monotone SNP sentences do not contain literals of the form x = y 
(hence, in the terminology of Feder and Vardi 89 1, we work here with monotone 
SNP without inequality] the reason why Feder and Vardi add the attribute without 
inequalities is that for them, SNP sentences are written in negation normal form, so 
forbidding literals of the form x — y amounts to forbidding inequalities in negation 
normal form). 

We also assume that monotone SNP sentences do not contain literals of the form 
x =/= y. This is without loss of generality, since every monotone SNP sentence is 
equivalent to one which does not contain literals of the form x =^ y. To obtain the 
equivalent sentence, we remove literals of the form x ^ y and replace all occurrences 
of y in the same clause by x. Note that the SNP sentences given in Example |1.4.2[ 
|1.4.3| and |1. 4.4] can be easily re- written into equivalent monotone SNP sentences. 

The class of structures that satisfies a given monotone SNP sentences is clearly 



closed under inverse homomorphisms. The converse is a result by Feder and Vardi 90 
it shows that for SNP, the semantic restriction of closure under inverse homomor- 
phisms and the syntactic restriction of monotonicity match. 

Theorem 1.4.8 (from [90]). Let $ be an SNP sentence. Then the class of struc- 
tures that satisfies $ is closed under inverse homomorphisms if and only if $ is 
equivalent to a monotone SNP sentence. 

When ip is a clause of a first-order formula <p m conjunctive normal form, let 
-iipi, ■ ■ ■ , ^ipk be the negative literals in ip, i.e., for all i the formula ipi is of the form 
R(x) for R G r U {Ri, • • • , Rk}- Then the canonical database of ip is the canonical 
database (see Section |1.2| of ipx A • • • A ipk (that is, in this definition the positive 
literals in ip are ignored). We say that ip is connected if the canonical database of 
ip is connected (see Section . We say that an SNP sentence $ is connected if all 
clauses of the first-order part <p of $ are connected. 

Theorem 1.4.9. Let $ be an SNP sentence. Then the class of structures that 
satisfies $ is closed under disjoint unions if and only if $ is equivalent to a connected 
SNP sentence. 

Proof. Let be of the form 3R\, . . . ,Rk- Vxi, ...,X\. <p where cf> is a quantifier- 
free first-order formula over the signature cr = (t U {Ri, . . . , Rk})- 

Suppose first that $ is connected, and that 2li and 2I2 both satisfy $. In other 
words, there is a cr-expansion 21^ of 2li and a er-expansion 2t| of 2l 2 such that those 
expansions satisfy Vxi, ... ,xi.<fi. We claim that the disjoint union 21* of 2lf and 21*, 
also satisfies Vxi, ...,xi.(p; otherwise, there would be a clause ip in <j> and elements 
ax, . . . ,a q of Ai U A% such that ip(ax, ■ ■ • , a q ) is false in 21*. Since 2lj and 2t| satisfy 
Vxi, . . . , xi.ip(xx, ■ • ■ , xi), there must be i, j such that a, € Ax and Oj € A%. But then 
the canonical database for ip is disconnected, a contradiction. 

For the opposite direction of the statement, assume that the class of structures 
that satisfies <E> is closed under disjoint unions. Consider the SNP sentence ^> = 
3Rx, . . . , Rk, E. Vxi, . . . ,xi. ip where ip is the conjunction of the following clauses (we 
assume without loss of generality that I > 3) . 

• For each relation symbol R € r, say of arity p, and each i < j < p, add the 
conjunct ->R(xx, ■ ■ ■ ,x p ) V E(xi,Xj) to ip. 

• Add the conjunct ->E{xx,X2) V ->E(x2, £3) V E(xx, £3) to ip. 

• Add the conjunct ->E(xx, x%) V E{x2,xx) to ip. 

• For each clause <p' of <p with variables y\, ■ ■ ■ ,y q Q {xx, ■ ■ ■ , x{\, add to ip the 



conjunct 



i<j<q 
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We claim that the connected monotone SNP sentence ^> is equivalent to 3". Suppose 
first that 21 is a finite structure that satisfies Then there is a cr-expansion 21' of 
21 that satisfies <fi. The expansion of 21' by the relation E = A 2 shows that 21 also 
satisfies ij}- 

Now suppose that 21 is a finite structure with domain A that satisfies \P. Then 
there is a (a U {^-expansion 21' of 21 that satisfies tp. Write 21' = % l±l • • • W 2tJ for 
connected c-structures 21 ^ , . . . , 21J . Note that the clauses of ip force that the relation 
E denotes Af in the structure 21^, for each i < I, Let 21; be the cr-reduct of 21-. 
Then 21; satisfies cj>, because if there was a clause (/>' from </> violated in 21; then the 
corresponding clause in ip would be violated in 21^. Hence, 2ti (= $ for all i < I, and 
since $ is closed under disjoint unions, we also have that 21 \= $. □ 

Theorem 1 1 . 4 . 8| combined with the previous result shows the following. 

COROLLARY 1.4.10. An SNP sentence <I> describes a problem of the form CSP(03) 
for an infinite structure 03 if and only if $ is equivalent to a monotone and connected 
SNP sentence 

Proof. Suppose first that $ is a monotone SNP sentence with connected clauses. 



To show that $ describes a problem of the form CSP(03) we can use Lemma 1.1.7 
It thus suffices to show that the class of structures that satisfy $ is closed under 
disjoint unions and inverse homomorphisms. But this has already been observed in 
Theorem [TX8] and Theorem [i~L9l 

For the implication in the opposite direction, suppose that $ describes a problem 
of the form CSP(03) for some infinite structure 03. In particular, the class of structures 
that satisfies $ is closed under inverse homomorphisms. By Theorem |1.4.8[ <& is 
equivalent to a monotone SNP sentence. Moreover, the class of structures that satisfies 
$ is closed under disjoint unions, and hence $ is also equivalent to a connected SNP 
sentence. By inspection of the proof of Theorem 1.4.9[ we see that when $ is already 



monotone, then the connected SNP sentence in the proof of Theorem 1.4.9| will also 
be monotone. It follows that $ is also equivalent to a connected monotone SNP 
sentence. □ 

1.4.4. Monadic SNP. When we further restrict monotone SNP by only al- 
lowing unary existentially quantified relations, the corresponding class of problems, 
called montone monadic SNP (or, short, MMSNP), gets very close to finite domain 
constraint satisfaction problems. Indeed, Feder and Vardi showed that the class MM- 
SNP exhibits a complexity dichotomy if and only if the class of all finite domain CSPs 
exhibits a complexity dichotomy (that is, if the dichotomy conjecture mentioned in 
the introduction is true). In one direction, this is obvious since MMSNP obviously 
contains CSP(03) for all finite structures 03 (we may use a unary relation symbol for 
each element of 03). In the other direction, Feder and Vardi showed that every prob- 
lem in MMSNP is equivalent under randomized Turing-reductions to a finite domain 
constraint satisfaction problem. The reduction has subsequently been derandomized 



by Kun 134 



Theorem 1.4.11 (of [89] and 134 ; see 148 for a formalization) . Every problem 
in monotone monadic SNP is polynomial-time Turing equivalent to CSP(03) for a 
finite structure 03. 

Similarly as in the previous section, we might ask for a syntactic characterization 
of those monadic SNP sentences that describe a CSP. Note that this does not directly 
follow from Corollary |1.4.10| since the reductions used there introduce additional 
existentially quantified relations that are not monadic. However, we have the following 
monadic version of Theorem 11.4.81 



1.5. EXAMPLES 



21 



Theorem 1.4.12 (Theorem 3 in [90]). Let $ be a monadic SNP sentence. Then 
the class of structures that satisfies $ is closed under inverse homomorphisms if and 
only if <1> is equivalent to a monotone monadic SNP sentence. 

Moreover, one can show the following monadic version of Proposition |1. 4.9] 

Proposition 1.4.13. Let $ be a monadic SNP sentence. Then the class of struc- 
tures that satisfies $ is closed under disjoint unions if and only if $ is equivalent to 
a connected monadic SNP sentence. 

Proof. Let V be the set of variables of the first-order part of let Pi, . . . , Pk 
be the existential monadic predicates in $, and let r be the input signature so that 
has signature {Pi, . . . , Pk} U r. If $ is connected, then it describes a problem that 
is closed under disjoint unions; this follows from Theorem |1.4.9| 

For the opposite direction, suppose that $ describes a problem that is closed 
under disjoint unions. We can assume without loss of generality that <I> is minimal in 
the sense that if we remove literals from some of the clauses the resulting SNP sentence 
is inequivalent. We shall show that then <E> must be connected. Let us suppose that 
this is not the case, and that there is a clause ip in that is not connected. The 
clause ip can be written as ipi V ip 2 where the set of variables X C V of ipi and the 
set of variables Y C V of ip 2 are non-empty and disjoint. Consider the formulas 
<f>x and <S>y obtained from $ by replacing ip by ipi and ip by ip 2 , respectively. By 
minimality of <E> there is a r-structure 21 1 that satisfies <I> but not $x, and similarly 
there exists a r-structure 2l 2 that satisfies $ but not <&y By assumption, the disjoint 
union 21 of 21 1 and 2l 2 satisfies $. So there exists a r U {Pi, . . . , Pfc {-expansion 21' 
of 21 = 2li W 21 2 that satisfies the first-order part of <&. Consider the substructures 
21 ^ and 2l 2 of 21' induced by Ai and A 2 , respectively. We have that QL[ does not 
satisfy ipi (otherwise 2ti would satisfy $x)- Consequently, there is an assignment 
si : V — > A\ of the universal variables that falsifies tp\. By similar reasoning we 
can infer that there is an assignment s 2 : V — > A 2 that falsifies ip2- Finally, fix any 
assignment s : V — >• A\ U A 2 that coincides with s% over X and with s 2 over Y (such 
an assignment exists because X and Y are disjoint). Clearly, s falsifies ip and 21 does 
not satisfy <5, a contradiction. □ 

Similarly as in Corollary |1. 4. 10| for SNP, we can combine the conditions of closure 
under inverse homomorphisms and closure under disjoint unions, and arrive at the 
following. 

Corollary 1.4.14. A monadic SNP sentence $ describes a problem of the form 
CSP(93) for an infinite structure 58 if and only if $ is equivalent to a connected 
monotone monadic SNP sentence. 

We want to remark that the problems that can be described by connected mono- 
tone monadic SNP sentences are exactly the problems called forbidden patterns prob- 
lems in the sense of Madelaine [147 . Clearly, for every finite 25 the problem CSP(Q5) 



is a forbidden patterns problem. In 150 is has been shown that the problems in 



MMSNP are exactly finite unions of forbidden patterns problems (going back to ideas 
from 



89 



We summarize the landscape of classes of computational problems from this sec- 
tion in Figure |1.6| 

1.5. Examples 

We present computational problems that have been studied in various areas of 
theoretical computer science, and that can be formulated as constraint satisfaction 
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Figure 1.6. Fragments of SNP. 



problems in the sense of Section |1.1| \l.2\ |1.3[ or |1.4| We describe each problem 
from the perspective in which the computational problem has appeared first in the 
literature. 

Our list is by far not exhaustive; computational problems that can be exactly 
formulated as CSP(*B) for an infinite structure 03 are abundant in almost every area 
of theoretical computer science. 

1.5.1. Allen's interval algebra. Allen's interval algebra [H] is a formalism that 
is famous in artificial intelligence, and which has been introduced to reason about 
intervals and about the relationships between intervals. 



Formally, Allen's interval algebra is a proper relation algebra (see Section 1.3.2); 
we can also view it as a structure with a binary relational signature. The domain is 
the set I of all closed intervals [a, 6] of rational numbers, where a, b £ Q, a < b. When 
x — [a,b] is an interval, then —x denotes the interval [—b, —a]. For R C Q 2 , R~ 
denotes the relation {(— x, — y) | (x,y) G R}. Recall that in proper relation algebras, 
R~ denotes the relation {(y,x) | [x,y) G R}. 



Relation Symbol 


Definition 


Explanation 


P 


{([a,b],[c,d]) \b<c} 


[a, b] preceeds [c, d] 


M 


{([a,b],[c,d]) | b = c} 


[a, b] meets [c, d] 


O 


{([a,b], [c,d\) | a < c < b < d} 


[a,b] overlaps with [c,d] 


S 


{([a, b], [c, d]) | a = c and b < d} 


[a, b] starts [c, d] 


D 


{([a,b], [c,d]) | c < a < b < d} 


[a, b] is during [c, d] 


E 


{([a, b], [c, d]) | a — c,b = d} 


[a, b] equals [c, d] 



Figure 1.7. The definitions for the basic relations of Allen's interval algebra. 



The basic relations of Allen's interval algebra are the 13 relations P, M, O, S,D,E 
(defined in Figure 1.7|, P~ ,M~,0~, S~ , and the inverse of S, D, and S~ , denoted by 
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S", , and (S~)^, respectively. Note that those 13 relations are pairwise disjoint, 
and that their union equals I 2 . Recall our convention that when 72. is a subset of the 
basic relations, we write xlZy if (x, y) € U_rg7?.-^- For example, x{P, P~}y signifies 
that the intervals x and y are disjoint. The 2 13 relations that arise in this way will 
be called the relations of Allen's interval algebra. 

An important computational problem for Allen's interval algebra is the network 
satisfaction problem for Allen's interval algebra. This problem can be viewed as 
CSP(2l) where 21 is a structure with domain I and a signature containing 2 13 binary 
relation symbols (see Section 1.3.2 ). More on this structure can be found in Chaptcr|3j 



Example |3.1.11| We are sometimes sloppy and write Allen's interval algebra when we 
mean 21 (rather than A). 

The problem CSP(2l) is NP-complete 8j. The complexity of the CSP for (binary) 



reducts of Allen's interval algebra has been completely classified in 133 



1.5.2. Phylogenetic reconstruction problems. In modern biology it is be- 
lieved that the species in the evolution of life on earth developed in a mostly tree-like 
fashion: at certain time periods, species separated into sub-species. The goal of 
phylogenetic reconstruction is to determine the evolutionary tree from given partial 
information about the tree. This motivates the computational problem of rooted triple 
satisfiability (also called rooted triple consistency), defined below. In 1981, Aho, Sa- 
giv, Szymanski, and Ullman [4] presented a quadratic time algorithm to this problem, 
motivated independently from computational biology by questions in database theory. 

Let T be a tree with vertex set T and with a distinguished vertex r, the root of 
T. For u, v € T, we say that u lies below v if the path from u to r passes through v. 
We say that u lies strictly below v if u lies below v and a/». The youngest common 
ancestor ( yea) of two vertices u, v £ T is the node w such that both u and v lies below 
w and w has maximal distance from r. 

Rooted- Triple Satisfiability 

INSTANCE: A finite set of variables V, and a set of triples xy\z for x,y,z G V. 
QUESTION: Is there a rooted tee 1 with leaves L and a mapping s : V —> L such 
that for every triple xy\z the yea of s(x) and s(y) lies strictly below the yea of s(x) 
and s(z) in T? 

Another famous problem that has been studied in this context is the quartet 



satisfiability problem, which is NP-complete 182 



Quartet Satisfiability 

INSTANCE: A finite set of variables V, and a set of quartets xy.uv with x, y, u, v £ V . 
QUESTION: Is there a tee 1 with leaves L and a mapping s : V —> L such that for 
every quartet xy.uv € R yca the shortest path from x to y is disjoint to the shortest 
path from u to v? 

It is straightforward to check that the class of positive instances (viewed as rela- 
tional structures) of each of those two computational problems is closed under disjoint 



unions and inverse homomorphisms. By Lemma 1.1.7 both the rooted triple satis- 
faction problem and the quartet satisfaction problem can be formulated as CSP (05) 
for an infinite structure *B. We come back to those CSPs in Chapter |4j 



1.5.3. Branching-time constraints. An important model in temporal reason- 
ing is branching time, where for every time point the past is linearly ordered, but the 
future is only partially ordered. 
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FIGURE 1.8. The composition table for the basic relations in the 
left-linear point algebra. 



This motivates the so-called left-linear point algebra 83||lll , which is a relation 



algebra with four basic relations, denoted by =, <, >, and |. Here we imagine 
that l x < y' signifies that x is earlier in time than y, and to the left of y when we 
draw points in the plane, and this motivates the name left linear point algebra. The 
composition operator on those four basic relations is given in Figure |1.8[ The inverse 
of < is >, Id denotes =, and | is its own inverse, and the relation algebra is uniquely 
given by this data. 

As explained in Section |1.3.2| the network consistency problem for the left-linear 
point algebra can be viewed as CSP(23) for an appropriate infinite structure with 
16 = 2 4 binary relations (one for each subset of {=,<,>,]}). In this structure, 
< must denote a dense partial order which is semi-linear, i.e., for every x the set 
{y | y < x} is linearly ordered by by <. An explicit example of such a structure is 
given in Section |4.2| The network consistency problem of the left-linear point algebra 
is polynomial-time equivalent to the following problem, which we call branching-time 
satisfiability problem. 

Branching-Time Satisfiability 

INSTANCE: A finite relational structure 21 = {A;<, ||,^) where <, ||, and ^ are 
binary relations. 

QUESTION: Is there a rooted tree 1 and a mapping s : A — > T such that in 1 the fol- 
lowing is satisfied: a) If (x, y) g < 2t , then s(x) lies above s(y); b) If (x,y) g then 
neither s(x) lies strictly above s(y), nor s(y) strictly above s(x); c) If (x,y) e 
then s(x) ^ s(y). 

The idea why this problem is polynomial-time equivalent to the network satisfac- 
tion problem of the left-linear point algebra is the observation that in any represen- 
tation *B of the left-linear point algebra, the relation x{<, >,=}y has the primitive 
positive definition 

3z. x{<,=}z Ay{<,=}z , 
and the relation x{<, |, =}y has the primitive positive definition 

3z. x{<, —}z A z{\, =}y ; 



we can then use Theorem 1.2.6 (for details, see [41] ). 

The branching-time satisfiability problem can be formulated as CSP(£) for the 
structure with domain C := {0,1}* and relations <, ||, and 7^, where < denotes the 
relation 

{(u, v) € C 2 I u is a prefix of v} . 

The relation ^ is the disequality relation, and u \\ v holds if u and v are equal or 
incomparable with respect to <. Let < denote the intersection of < and ^. Note 
that the structure £ can not be used to obtain a representation of the left-linear point 
algebra, since (<) o (<) does not equal <. 
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FIGURE 1.9. The composition table for the basic relations of Cor- 
nell's tree algebra C. 



The first polynomial-time algorithm for the branching-time consistency problem 
(and therefore also for the network satisfaction problem of the left-linear point alge- 
bra) is due to Hirsch |lll| , and has a worst-case running time in 0(n 5 ). This has 
been improved by Broxvall and Jonsson [54) . who presented an algorithm running in 
0(n 3 376 ) (this algorithm uses an 0(n 2 ' 37b ) algorithm for fast integer matrix multipli- 
cation). A simpler algorithm which docs not use fast matrix multiplication and runs 
in O(nm) has been found in |40|. 



1.5.4. Cornell's tree description constraints. Motivated by problems in 



computational linguistics, Cornell 73 introduced the following computational prob- 
lem^] It is a strictly more expressive problem than the branching time satisfaction 



problem from the previous section, but has been introduced independently from 111 
and [54| . There are many equivalent formulations of this problem. One is as the gen- 
eral network satisfaction problem for the relation algebra C with atoms =, <, >, -<, 
and y which is given by the composition table in Figure |1.9| 

There is a natural interpretation of the finite representations of Cornell's tree 
algebra C which we describe in the following. Let 1 be a rooted tree. We define the 
binary relations < z and on the vertices V of 1 as follows. When u, v g V, then 
u < v if u is a proper ancestor of v. To define -< , we start a depth-first search at 
the root of T, and order the vertices according to the time they were visited first by 
the depth-first search. For u, v € V, define u -<~ v if u and v are incomparable with 
respect to < % , and if u is visited before v during the depth-first search. 

Proposition 1.5.1. For every finite representation (D,i) of Cornell's tree algebra 
C, there exists a rooted tree *T with vertices D U {r} such that r denotes the root of 
H, i(<) = < , and i{~<) = -< 5 '. Conversely, for every rooted tree X with vertices V 
there exists a representation (V,j) of C such that j(<) = < , and i(-<) = -< . 



Similarly as in Section 1.5.3 all 2 5 relations of C can be obtained by repeated 
compositions and intersections of the four relations {<,=}, {-<,=}, {-<, !^,=}, and 
{<,>,^,^}; for details, see |41| . The algorithm presented for the general network 
satisfaction problem for C in [73| is not complete. A polynomial-time algorithm has 
been found in [41] , 

1.5.5. Set constraints. Many fundamental problems in artificial intelligence, 
knowledge representation, and verification involve reasoning about sets and relations 
between sets and can be modeled as constraint satisfaction problems. One of the most 
fundamental problems of this type is the following. We denote the set of all subsets 
of N by V(N). 

^1 feel personally committed to Cornell's problem since it was the first CSP with an u-categorical 
template I met. 
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Basic Set Constraint Satisfiability 

INSTANCE: A finite set of variables V, and a set <j> of constraints of the form x C y, 
x 1 1 y, or x 7^ y, for x, y € 

QUESTION: Is there a mapping s : V -> 7>(N) such that 

a) If a; C y is in (/>, then s(x) is contained in s(y); 

b) If x || y is in <j>, then s(x) and s(y) are disjoint sets; 

c) If x 7^ y is in <f), then s(x) and s(y) are distinct sets. 

This problem has the shorter description CSP((7 : '(N); C, ||, =^)) where C, ||, ^ are 
binary relations over P(N), standing for the binary relations containment, disjointness, 
and inequality between sets. Drakengren and Jonsson [81| showed that basic set 
constraint satisfiability can be decided in polynomial time. They also showed that the 
generalization of the problem can be solved in polynomial time where each constraint 
has the form 

%i 7^ Vi V • • • V Xfc ^ y k V x Ryo 

where R is either C, ||, or 7^, and where Xo, . . . , Xk, yo, ■ ■ ■ , yk are not necessarily 
distinct variables. 



1.5.6. Spatial reasoning. Qualitative spatial reasoning (QSR) is concerned 
with representation formalisms that are considered close to conceptual schemata used 
by humans for reasoning about their physical environment — in particular, about pro- 
cesses or events and about the spatial environment in which they are situated. The 
approach in qualitative reasoning is to develop relational schemas that abstract from 
concrete metrical data of entities (for example time points, coordinate positions, or 
distances) by subsuming similar metric or topological configurations of entities into 
one qualitative representation. 

There are many formalisms for qualitative spatial reasoning. In particular, sev- 
eral relation algebras (see Section 1.3.2) have been studied in this context. A basic 
example is the RCC5 relation algebra (with 5 atoms; the RCC5 relation algebra is 
also known under the name containment algebra |19||83] ), and the RCC8 relation al- 
gebra (with 8 atoms). In both formalisms, the variables denote 'non-empty regions'. 
In RCC5, the five atoms are denoted by DR, PO, PP, PPI, EQ, and they stand 
for disjointness, proper overlap, proper containment (proper-part- or), its inverse, and 
equality, respectively. In RCC8, we further distinguish how the 'boundaries' of two 



regions relate to each other. We do not further discuss RCC8, for details, see 50 83 



There are many equivalent ways to formally define RCC5. Often, this is done 
by specifying the composition table for atomic relations, but we find this tedious. 
Here, we rather define RCC5 as the proper relation algebra whose domain are all 
open (or all closed) disks in K 2 , and where the basic relations are disjointness (empty 
intersection), proper overlap, containment, the inverse of containment, and equality 
of disks. Then RCC5 is the abstract relation algebra of the proper relation algebra 
of closed disks (see Section 2.1.5 in 83 ). 

The network satisfaction problem for RCC5 is NP-complete; the computational 
complexity of the CSP for the (binary) reducts of *B has been classified in 121||172 



A polynomial-time tractable case of particular interest is the network satisfaction 
problem for the basic relations of RCC5 |172| , i.e., the network satisfaction problem 
for RCC5 when the input is restricted to networks N = (V; /) where / maps to 1 or 
the atoms in RCC5 only. 

In any representation of RCC5, the atomic relations satisfy the following set of 
axioms T. We use P{x,y) as a shortcut for PP(x,y) V EQ(x,y). 
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T := { Vx,y,z (DC(x,y) A P(z,y) ^ DC{x,z)) 

Vx, y, z {PO(x, y) A P(y, z) -> (PO(ac, z) V PP(.x, z))) 

Vx, y, z (PP(i, y) A PP(y, z) PP{x, z)) 

Vx, y, z (P- 1 (x, y) A P(y, z) -> nDC(i, y)) } 

It is easy to see that the network satisfaction problem for the basic relations of 
RCC5 is essentially the same problem as CSP(T), where T is the first-order theory 
defined as above. It can be checked easily that T satisfies item (2) in the statement 



of Proposition 1.3.6 and hence there exists an infinite structure 23 such that CSP(23) 
equals the satisfiability problem for RCC5. We will give more explicit descriptions of 
such an infinite structure 23 in Chapter E] (and it turns out that there are close links 



with the problem from Section 1.5.5). 



1.5.7. Horn-SAT. The following problem is an important P-complete prob- 



lem 62 . It can be solved in linear time in the size of the input 79 



Horn-SAT 

INSTANCE: A propositional formula in conjunctive normal form (CNF) with at most 
one positive literal per clause. 

QUESTION: Is there a Boolean assignment for the variables such that in each clause 
at least one literal is true? 

We cannot model this problem as CSP(23) for a finite signature structure; how- 
ever, note that a clause ->Xi V ■ ■ ■ V ->Xfc V Xq is equivalent to 

3yi, . . .,yk-i ((->xi V ^x 2 V yi) A V ^x 3 V y 2 ) A • • • A (^y fc _i V ^x k V x )) . 

Hence, by introducing new variables, there is a straightforward reduction of Horn- 
SAT to the restriction of Horn-SAT where every clause has at most three literals. 
This restricted problem, which we call Horn-3SAT, can be formulated as CSP(23) for 
23 = ({0,l};{( 2 ;,y,z) | (xAy)^z}). 

1.5.8. Precedence constraints in scheduling. The following problem has 



been studied in 157 in scheduling: given is a finite set of variables V, and a finite 



set of and/or precedence constraints, i.e., constraints of the form 

Xq > X\ V ■ ■ ■ V Xq > Xk (6) 

for xq,Xi, . . . ,Xff € V. The question is whether there exists an assignment V — > Q 
(equivalently, we can replace Q by Z, or any other infinite linearly ordered set). 

As in the case of Horn-SAT, we cannot directly model this problem as CSP(25) 
for a finite signature structure 23. However, note that Formula ^ is equivalent to 

• • -,yk-x( (xo > x\ V x > yi)A 

(yi > x 2 V yi > y 2 ) A ■ • • A (y fe _i > x k _ x V y fe _i > x k )) . 

This shows that and/or precedence constraints can be translated into conjunctions 
of constraints of the form xq > x\ V xq > x 2 by introducing new existentially quan- 
tified variables. Hence, the problem whether a given set of and/or precedence con- 
straints is satisfiable reduces naturally to CSP((Q; P mm )) where P mm is the ternary 
relation {(a, b, c) | a > 6 V a > c}. Note that P mm holds on exactly those triples 
(a, b, c) where a is larger than the minimum of b and c. The problem CSP((Q; P min )) 



can be solved in polynomial time; this is essentially due to 157 . For more expressive 
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constraint languages over Q that contain the relation R mm and whose CSP can still 
be solved in polynomial time, see Section fl0.5.2| or Section flO. 4. 1| 

1.5.9. Ord-Horn constraints. In this section we work with first-order formulas 
over the signature {<}. We write x < y as a shortcut for (x < y) V (x — y) (recall 
our convention that equality is part of first-order logic). A formula over the signature 
{<} and with variables V is called Ord-Horn if it is a conjunction of disjunctions of 
the form 

(xi = yx) V • • • V (x k = y k ) V {x Ry ) 
where Xq, x\, . . . , x k , yo, yi, ■ ■ ■ , y k € V, and R is either <, <, or =. 

Ord-Horn Satisfiability 

INSTANCE: A finite set of variables V, and a finite set of Ord-Horn formulas with 
variables from V. 

QUESTION: Is there an assignment V — > Q that satisfies all the given formulas over 

(Q;<)? 



Nebel and Biirckert 158 showed that Ord-Horn Satisfiability can be solved in 
polynomial time. A relation R C Q k is called Ord-Horn if it is definable by an Ord- 
Horn formula over (Q;<). As in the case of Horn-SAT and of and/or precedence 
constraints, there are structures 03 with finitely many Ord-Horn relations such that 
all Ord-Horn relations have a primitive positive definition in 03. It can be shown that 
the following structure has this property (see Chapter [6| . 

(Q; <, A {(x, y, u, v)\(x = y) ^{u = v)}) 



In Section 10.4.1 we see that constraint languages that contain and/or precedence 



constraints and Ord-Horn constraints can still be solved in polynomial time. 

1.5.10. Ord-Horn interval constraints. For some (binary) reducts 03 of Allen's 
interval algebra the problem CSP(03) can be solved in polynomial time. The most 
important of those reducts is the class of Ord-Horn interval constraints, which has 
been introduced by Nebel and Burkert [158 . It consists of all those relations TZ of 
Allen's interval algebra such that the relation {(x, y, u, v) \ ([x, y], [u, v]) <G 7^} is Ord- 



Horn (see Section 1.5.9). Now it is not hard to see that satisfiability for Ord-Horn 



interval constraints has a polynomial-time reduction to Ord-Horn satisfiability. This 



type of reduction will be studied in Section 5.5 



1.5.11. Linear program feasibility. Linear Programming is a computational 



problem of outstanding theoretical and practical importance (see e.g. 178 ). It is 
known to be computationally equivalent to the problem to decide whether a given set 
of linear (non-strict) inequalities is feasible, i.e., defines a non-empty set. 

Linear Program Feasibility 

INSTANCE: A finite set of variables V; a finite set of linear inequalities of the form 
d\Xi + ■ ■ • + afeXj. < a where X\, . . . ,Xk € V and oq, . . . , a k are rational numbers 
where numerator and denominator are represented in binary. 
QUESTION: Does there exist an x € M' v ' that satisfies all inequalities? 



Kachyian showed in 128 that Linear Program Feasibility can be solved in poly- 
nomial time. It is clearly not possible to formulate this problem as CSP (03) for a 
structure 03 with a finite relational signature. However, we show below that it is 
polynomial-time equivalent to CSP ((R; {(x, y, z) \ x + y = z}, {1}, <)). For this, we 
need the following lemma. 
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Lemma 1.5.2 (from [35] ). Let n , . . . , n.; € Q be arbitrary rational numbers. Then 
the relation {(x%, . . . ,xi) \ n\X\ + . . . + nixi = uq} is primitive positive definable in 
(R; {(x, y, z) \ x + y = z},{l}). Furthermore, the primitive positive formula that 
defines the relation can be computed in polynomial time. 

The idea to prove this is to use iterated doubling to define large numbers with 
small primitive positive formulas. By extending the previous result to inequalities, 
one can prove the following. 

PROPOSITION 1.5.3 (from |35| ). The linear program feasibility problem for linear 
programs is polynomial-time equivalent to CSP((IR; {(x, y, z) \x + y = z},{1},<)). 

1.5.12. The max-atoms problem. In our list of problems from the literature 
that can be formulated as CSP(Q3), we also want to include one problem in NP where 
it is not known whether CSP(Q3) is in P or NP-hard. Our problem is closely related to 



the following problem, which has been introduced in 21 and, independently, in 157 



The Max- Atoms Problem 

INSTANCE: A finite set of variables V; a finite set of constraints of the form xq < 
max(aixi, . . . , auXk) where x\, . . . , Xk G V and do, • • • ,£Jfc (the coefficients) are inte- 
gers represented in binary. 

QUESTION: Does there exist an x S Q' y ' that satisfies all inequalities? 

It is known that the Max-atoms problem is computationally equivalent to mean- 



payoff games 157 , and therefore it is contained in NP n coNP. It also follows that 
deciding the winner in Parity games and satisfiability of the propositional /i-calculus 
can be reduced to the max-atoms problem. Bezem, Nieuwenhuis and Rodrfguez- 



Carbonell 21 give an alternative proof that the problem is in NP n coNP. In the 
same paper, they also shown that a certain hypergraph reachability problem, and 
an intensively studied problem in max/+ algebra are equivalent to the max-atoms 
problem. Moreover, they show that the problem is in P when the coefficients in the 
input are represented in unary. 

Similarly as for linear program feasibility, the Max-atoms problem cannot be 
formulated as CSP(*8) for a structure 03 with a finite relational signature. The 
problem we introduce instead is 

CSP((Q; {(x, y)\y = x + 1}, {(x, y)\y = 2x}, R mm )) 



where R mm has been introduced in Section 1.5.8 The max-atoms problem can be 
reduced to this problem: we replace expressions of the form Xi + ai by a new variable 
t/i, and add a primitive positive formula 4>(x,y) that defines yi — Xi + at and can be 
computed in polynomial time in the input size of the max-atoms problem. We do not 
know how to prove hardness for the CSP above, and rather think that the problem 
might well be in P. 

1.5.13. Unification. Unification (and unification modulo equational theories) 
is an proper field in computational logic, and the complexity of the unification problem 
has been studied in numerous variants |10| . Many unification problems can be viewed 
as CSP(25), for an appropriate infinite structure 03, as we will see in the following. 
We start with the most fundamental unification problem. 

Let a := {/i, . . . , /&} be a finite set function symbols, and let a; be a variable 
symbol. Then F(x) denotes the set of all terms that can be constructed from r and 
the variable x. The unnested unification problem over t is the following problem^ 

^This problem is known to be equivalent to the standard unification problem where the input 
is a single equation ti ?s for 'nested' terms t±,t2 S T{x). 
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Unnested Unification Problem over r 

INSTANCE: a finite set of variables V, and a finite set of 'un-nested' term equations, 
i.e., expressions of the form y a « /(yi, . . . , y k ) for y , yx, . . . , yk € V and / Sr. 
QUESTION: is there an assignment s : V — > -F(x) such that for every expression 
Da = f(vi, ■ ■ • > Vk) in the input we have s(y ) = f{s(yx), . . . , s(y k ))7 



For fixed r as above, let T = (.F(x); Fx, . . . , F^) be the structure where F{ is the 
relation {(t ,tx, ■ ■ ■ ,t r ) € (T(x)) r+1 \ to = fi(tx, ■ ■ ■ ,t r )} (here, r is the arity of /,). 
It is clear that the unnested unification problem over r can be described as CSP(T). 
In a similar way, equational unification problems (see |10| ) can be viewed as CSPs. 

1.6. Overview 

This text develops the universal-algebraic approach for ocomplexity analysis of 
constraint satisfaction problems with countably infinite uj -categorical templates. Parts 
of the corresponding theory follow or generalize the universal-algebraic approach for 
CSPs with finite templates, whereas other parts are specific to infinite domains, such 
as the way in which we apply Ramsey theory. We then present two complexity 
classification results that have been obtained using this approach: the classification 
of temporal CSPs in Chapter [TUJ and Schaefer's theorem for graphs in Chapter [9] 
We close with Chapter [TT] on classes of computational problems that probably do not 
exhibit a complexity dichotomy. 



Publication Note. My PhD-thesis 22] also treated constraint satisfaction with 
w-categorical templates, and already hinted at the relevance of polymorphisms and 
universal algebra. But it is only here that we fully present the universal-algebraic ap- 
proach and its applications for classification projects of large classes of infinite-domain 
constraint satisfaction problems. The self-contained presentation in this thesis is col- 
lecting and re-combining results that have been fragmented over various publications, 
and often come with new proofs. 

Parts of the content of this thesis have been published by co-authors and myself 
in conferences or journals. The example sections in Chapter [T] and Chapter [4] present 

Chapter [3] contains a new proof, to be 



CSPs studied in 23 29 32 34 



41 



42 



50 



published in the journal version of [33] , of the main result in [23] . Chapter [5] covers 



original results from 44 , 25 , and the survey [24] . Many results in Chapter [8] are 



from [49| and the survey 47 



The classification for equality constraint satisfaction problems has first been ob- 
tained in [37] , but the proof presented here is new and borrows results and ideas 
from [27| and [46] . The classification for temporal constraint satisfaction problems 

with some additions from the survey [47| 



in Chapter 10 is based on 38 and 39 



Schaefer's theorem for graphs is based on 48 and 46 , again with additions from 47 



Finally, Chapter 11 also contains some results from 31 



1.7. Uncovered Topics 

When choosing the material to be included in this thesis, certain restrictive choices 
had to be made. We comment on related lines of research or facets of the area that 
we had to skip. 

1.7.1. Infinite Signatures. Several natural computational problems could be 
formulated in the form CSP(93) when we would allow that the structure 05 has a count- 
ably infinite signature. For example, we might want to view the feasibility problem 



for linear programs (Section 1.5.11) as CSP(*B) where 03 contains all relations of the 
form {(xi, . . . , Xk) | a\Xx + • • • + a,k x k a o}j for all rational numbers ao, ax, ■ ■ ■ , afc- 
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Indeed, several general results for constraint satisfaction that we present in this thesis 
would carry over to infinite signatures with no problems. 

If we wanted to extend the present definition of CSP (58) to structures 58 with an 
infinite signature, we are faced with the difficulty to specify how the constraints in 
input instances of CSP (53) are represented. When 58 has a finite signature, this causes 
no problems, since we can fix any representation for the finite number of relation 
symbols; since 58 is considered to be fixed, the precise choice of the representation 
is irrelevant. When 58 has an infinite signature, a good choice how to code the 
constraints in the input very much depends on the structure 58. In the example of 
linear programming feasibility, for instance, we might want to represent the constraint 
a%Xi + • • • + a,kXk < ciq by specifying the coefficients ao, a\, . . . , at- in binary. 

Note that the issue of finite versus infinite constraint languages is not specific 
to infinite domains, but becomes relevant already for finite domains. Typically, for 
infinite constraint languages over a finite domain each constraint in the input is rep- 
resented by listing all tuples of the corresponding relation in the constraint language. 
But this is not the only, and sometimes not even the most natural way to represent the 



constraints. For instance for the Horn-SAT problem (see Section 1.5. 7), the most nat- 
ural way to present the constraint is by writing them as conjunctions of Horn-clauses. 
In the general setting, several representations have been proposed, some of which are 



more concise than listing all tuples 69 , and some of which are less concise 154 . 

It turns out that typically when a constraint satisfaction problem with an infinite 
constraint language is computationally hard, then there is a finite set of relations in 
this language such that the CSP for this sub-language is already NP-hard. For infinite 
constraint languages over a finite domain, and when the constraints are represented 
by explicitly listing all satisfying assignments to the variables of the constraint, it 



has even been conjectured 60 that this might be true in general; that is, when 
CSP(58) is NP-hard under this representation, then 58 has a finite signature reduct 
with a hard CSP. This conjecture is still open. We also want to mention a conditional 



non-dichotomy result for infinite constraint languages from 31 . 

We have decided to keep in this thesis the focus on CSPs for finite constraint 
languages, for the following reasons. This allows to work with one and the same 
definition of the computational problem CSP(58) for all infinite structures 58 with a 
finite signature. Moreover, for all of the algorithms presented in this thesis it will be 
immediately clear under which input assumption they might be generalized to deal 
with an infinite constraint language. This does not prevent us from stating relevant 
mathematical facts in full generality when they also hold for structures with an infinite 
signature; only when it comes to statements about CSP(58) we insist that 58 has finite 
relational signature. 

1.7.2. Complexity classes below P. Besides the mentioned progress on the 
dichotomy conjecture for finite domain CSPs, there has been considerable research 
activity to localize the exact complexity of CSPs inside the complexity class P, or 
with respect to definability in certain logics. By definability of CSP(58) we mean that 
there exists a sentence <E> is some logic (typically extensions of first-order logic and 
restrictions of least fixed point logics) such that 21 |= $ if and only if 21 homomorphi- 
cally maps to 58 (that is, in this case it is most natural to consider the definitions of 
the CSP presented in Section [TTT] and in Section 1.4). 



One motivation for studying computational complexity within P is the question 
whether it is possible to solve problems faster in parallel models of computation. 
Another motivation, in particular for definability of CSPs in certain logics, is the goal 
to better understand the scope of existing algorithmic techniques to solve CSPs (such 
as Datalog, or restrictions of Datalog) . 
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In this line of research, the computational complexity of CSP (03) has been com- 
pletely classified when 03 is a two-element structure [6j. Each problem in this class 
is complete for one of the complexity classes NP, P, Q>L, NL, L, and AC under 
AC isomorphisms. For general finite domains, several universal-algebraic conditions 
are known that imply hardness for various complexity classes (9l 138 . Concerning 
definability of CSPs, there are precise characterizations of those CSPs that are de- 



finable by a first-order sentence 8 137 173 . Moreover, if CSP(03) is not first-order 



definable, then it is L-complete under AC u -reductions 138 (also see 77 87 ). For 



infinite domain constraint satisfaction, it appears that there are no general results 
about localizing the complexity of CSPs within the complexity class P yet. However, 
we would like to remark that already in some concrete and model-theoretically well- 
behaved structures 03 the precise complexity of CSP(03) within P is open. We give 
one example. 

Example 1.7.1. Consider the problem 

CSP((Q; {(x, y, z) | (x = y => y < z) A x < y)) . 

This problem is NL-hard since there is an easy reduction from directed reachability 
to this problem, and directed reachability is an NL-complete problem. By careful 



inspection of Tarjan's linear-time algorithm for strongly connected components 184 
we see that the problem can be solved in linear time. However, the precise complexity 
of this problem is not known; it might be that the problem is contained in NL, but it 
might also be P-hard. 

1.7.3. Quantified CSPs. Let 03 be a structure with a finite relational signature. 
Then the quantified constraint satisfaction problem for 03, denoted QCSP(03), is the 
computational problem to decide for a given first-order sentence (f> in prenex normal 
form and without disjunction and negation symbols whether <j> is true in 03. The 
difference of QCSP(03) from CSP(03) as we have presented it in Section [Pj is that 
universal quantification is permitted in the input sentences (f>. 

The additional expressiveness often comes at the prize of higher computational 
complexity; whereas for finite structures 03, the CSP for 03 is always in NP, there 
are finite structures 03 where QCSP(03) is PSPACE-complete. But quite surpris- 
ingly, several constraint languages with a polynomial-time tractable CSP also have a 
polynomial-time QCSP. This is for instance the case for 2SAT [7] (see Example |lT2| , 



or for Horn-3SAT 124| (see Section 1.5.7). Similarly, it can be shown that the tem- 



poral constraint languages presented in Section [10. 5 . 2 1 and SectionflO.5.4 are not only 



tractable for the CSP, but also for the QCSP. These are attractive results, since they 
assert that we can solve an even more expressive computational problem than the 
CSP for the same constraint language without loosing polynomial-time tractability. 
From a methodological point of view, we remark that the universal-algebraic approach 
can also be applied to study the complexity of the QCSP [52] ; as in the case of the 
CSP, the computational complexity of QCSP (03) is captured by the polymorphisms 
of 03 (see Chapter [5]). Classifications of the QCSP typically rely on the corresponding 
classification for the CSP. In particular, any hardness result for the CSP immediately 
translates into a hardness result for the QCSP. Moreover, in the cases where the 
CSP (03) is tractable, the algorithmic insight is often the starting point for further 
investigations of QCSP(03). 

However, complexity classifications for QCSPs are typically harder to obtain than 
the corresponding complexity classifications for CSPs. One of the reasons is that 
several relevant universal-algebraic facts require the assumption that the algebra be 



idempotent (see Section 5.6.11. The complexity of the QCSP, however, is not pre- 



served by homomorphic equivalence, and when we study QCSP(03) we can thus not 
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pass to the core of 25. Hence, we can in general not make the assumption that the 
polymorphism clone of 25 is idempotent. 

For CSP(25), a powerful way of proving NP-hardness is to give a primitive positive 



interpretation (see Section 5.5) of a Boolean template with a hard CSP. This is no 
longer possible for the QCSP. There are for example 3-element templates 25 that 
are preserved by a semi-lattice operation (and hence no hard Boolean CSP can be 
interpreted in 25) where QCSP(25) is PSPACE-complete [52] . Finally, we would like 
to mention that PSPACE-hardness proofs for the QCSP are often much harder than 
NP-hardness proofs for the CSP (26 
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From the above is not surprising that a full classification of the QCSP complex- 
ity for three-element structures is still open. Similarly, there is no classification of 
the QCSP for the class of temporal constraint languages presented in Chapter [10] 
There are concrete temporal constraint languages where the QCSP is of unknown 
computational complexity, for instance the QCSP for 

(Q;{(x,y,z) \x = y^y>z}) . 

For this problem, we do not know hardness for any complexity class above P, and do 
not know containment in any complexity class below PSPACE. 

1.7.4. Non w-categorical templates. Most methods presented in this thesis 
crucially rely on the assumption that the constraint languages are w-categorical. How- 
ever, systematic complexity classification is also possible for large classes of constraint 
languages that are not w-categorical. 

1.7.4.1. Distance CSPs. The structure (Z; succ) of the integers with the sucessor 
relation succ = {(x,y) \ x = y + 1} constitutes one of the simplest infinite structures 
with a finite signature that is not w-categorical. Structures with a first-order definition 
in (Z; succ) are particularly well-behaved from a model-theoretic perspective: they 



are strongly minimal 114| 151 , and therefore uncountably categorical (but usually 
not cj-categorical) . In 30 , the complexity of CSP(25) has been studied when 25 is 
first-order definable in (Z;succ). As an example, consider the directed graph with 
vertex set Z which has an edge between x and y if the difference between x and 
y is either 1 or 3. This graph can be viewed as the structure (Z;i?{ 13 }) where 
^{1.3} — {{ x iV) \x — yE {1,3}}, which has a first-order definition over (Z; succ) since 
R{i,3}(x,y) iff 

succ(x, y) V 3u, v (succ{x, u) A succ{u, v) A succ(v, y)) . 

Another example is the undirected graph with vertex set Z where two integers x, y 
are linked if the distance between x and y is one or two. 

The corresponding class of CSPs contains many natural combinatorial problems. 
For instance, the CSP for the structure (Z;i?{ l!3 }) is the computational problem to 
label the vertices of a given directed graph G such that if (x,y) is an arc in G, then 
the difference between the label for x and the label for y is one or three. It follows 
from the results in |30| that this problem is in P. The CSP for the undirected graph 
(Z; {(x,y) | \x — y\ € {1,2}}) mentioned above is exactly the 3-coloring problem, 
and therefore NP-complete. In general, those problems have the flavor of assignment 
problems where the task is to map the variables to integers such that various given 
constraints on differences and distances (and Boolean combinations thereof) are sat- 
isfied. Therefore, CSPs whose template is definable over (Z; succ) are called distance 
CSPs. 

The complexity classification for distance CSPs presented in [30] is incomplete in 
two respects. First, it might be the case that a structure 25 with a first-order definition 
in (Z; succ) has a finite core 21. Those cores have the property that they have a single 
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orbit (see Section 1.2.3); this follows easily from the fact that the structure (Z; succ) 
also has only one orbit. But the dichotomy conjecture for finite domain CSPs has 
not yet been established for the particular case of finite templates that have only 
one orbit. The second point in which the complexity classification for distance CSPs 
given in 30 is incomplete is that it only studies templates 58 that are locally finite, 
an additional finiteness condition, defined as follows. A graph is called locally finite 
if every vertex is contained in a finite number of edges; a relational structure is called 
locally finite if its Gaifman graph (definition given in Section [2]) is locally finite. The 



method that is applied in 30 is relational, and not universal- algebraic. 



1.7.4.2. Tractable Expansions of Linear Program Feasibility. Linear Program- 
ming is a computational problem of outstanding theoretical and practical importance. 
As we have seen, it is computationally equivalent to the CSP we have presented in Sec- 
It seems to be interesting to investigate how far the constraint language of 



1.5.11 



tion 

all linear inequalities can be expanded such that the corresponding constraint satisfac- 
tion problem remains polynomial-time solvable. It has been shown in 35 that every 
first-order expansion of linear programming is contained in a class called Horn-DLR 
from 
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and polynomial-time tractable, or otherwise NP-hard. 
An important class of relations over the real numbers that generalizes the class 
of relations defined by linear inequalities is the class of all semi- algebraic relations, 
i.e., relations with a first-order definition in (K; +, *). By the fundamental theorem of 
Tarski and Seidenberg it is known that a relation S C M. n is semi-algebraic if and only 
if it has a quantifier-free first-order definition in (R;+,*,0, 1, <). Geometrically, we 
can view semi-algebraic sets as unions of intersections of the solution sets of strict and 
non-strict polynomial inequalities. The classification of the computational complexity 
of CSPs for real-valued semi-algebraic constraint languages is an ambitious research 
project, and has important links to semidefinite programming: every semidefinite rep- 
resentable set is semi-algebraic and convex. Surprisingly, there are many fundamental 
questions in this area that are wide open, for instance the complexity of semi-linear 

170| ), or the conjecture 



programming feasibility (see e.g. Section 6.4.4 in 190 



or 



that all convex semi-algebraic set are semidefinite representable |109 , i.e., primitive 



positive definable over the structure that has as its relations all the solution spaces of 
semi-definite programs. These important questions from real algebraic geometry are 
out of the scope of this thesis. 



CHAPTER 2 



Preliminaries in Logic 




This chapter collects some basic terminology and facts from logic. The notation 
mostly follows Hodges' text book 114 , so many readers may safely skip this chapter. 



2.1. Structures 

In Section |1.1| we have already defined relational structures; we now give the 
general definition of structures that might also contain functions, since we need those 
later. One occasion where we need functions rather than relations is in Chapter [5] 
when we consider algebras (by which we mean structures with a purely functional 
signature) that arise from the set of polymorphisms of a structure. Functions often 
appear naturally even when one is ultimately interested in relational signatures, see 
e.g. Proposition |7.4.5 Moreover, several templates are most naturally defined over 
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a structure having a functional signature, see e.g. Section |4.3| Most definitions go 
parallel for functional and relational signatures, so we give them together in this 
section. 

A signature r is a set of relation and function symbols, each equipped with an 
arity. A r -structure 21 is a set A (the domain of 21) together with a relation i? a C A k 
for each fc-ary relation symbol in r and a function / : A k — > A for each fc-ary function 
symbol in r; here we allow the case k = to model constant symbols. Unless stated 
otherwise, A, B,C, . . . denote the domains of the structures 21, 93, C, ... , respectively. 
We sometimes write (A; Rf, Rf, . . . , /f, • • • ) for the relational structure 21 with 
relations Rf, Rf, . . . and functions /f , /f, . . . When there is no danger of confusion, 
we use the same symbol for a function and its function symbol, and for a relation and 
its relation symbol. We say that a structure is infinite if its domain is infinite. The 
most important special cases of structures that appear in this thesis are relational 
structures, that is, structures with a purely relational signature, and algebras, that is, 
structures with a purely functional signature. Algebras with domain A, B, C, . . . are 
typically denoted by A, B, C, . . . 

Example 2.1.1. A group is an algebra G with a binary function symbol ■ for 
composition, a unary function symbol _1 for the inverse, and a constant e for the 
identity element of G, satisfying the sentences Vx, y, z. x -(y-z) — (x-y)-z,\/x.x-x~ 1 = 
e, Vx. e • x = e, and Vx. x ■ e = e. In this signature, the subgroups of G are precisely 
the subalgebras of G as defined below. We typically omit the function symbol • and 
write fg for the product of elements /, g of G. Such groups will also be called abstract 
groups to distinguish them from permutation groups; permutation groups are sets of 
permutations of a set X closed under composition and inverse. 

When working with function symbols, it is sometimes convenient to work with 
multi-sorted structures, where we have distinguished unary predicates, called sorts, 
that define a partition of the domain, and where function symbols might only be 
defined on some of the sorts (that is, the function symbols might not be defined on 
some of the elements). We are sloppy with the formal details since they can always 
be worked out easily. In all our applications, the multi-sorted structures will in fact 
be two-sorted, in which case we denote them by (21, 93) — here one sort induces the 
structure 21, and the other sort induces the structure 93. 

2.1.1. Expansions and reducts. Let a, r be signatures with a C r. When 
21 is a er-structure and 93 is a r-structure, both with the same domain, such that 
i? a = i?* 8 for all relations R £ a and / = for all functions and constants / G er, 
then 21 is called a reduct of 93, and 93 is called an expansion of 21. An expansion 93 
of 21 is called first- order if all new relations in 93 are first-order definable over 21. A 
structure 21 is called a finite reduct of 93 if 21 is a reduct of 93 with a finite signature. 
We also write (21, R) (and, similarly, (21, /)) for the expansion of 21 by a new relation 
R (a new function or constant /, respectively). 

If 21 is a r-structure and (ai)i^i a sequence of elements of A indexed by /, then 
(21; (ai)ig/) is the natural (r U {ci\i € 7})-expansion of 21 with \I\ new constants, 
where Ci is interpreted by ai for all i € /. 

2.1.2. Extensions and substructures. A r-structure 21 is a substructure of a 
r-structure 93 iff 

• A C B, 

• for each R £ r, and for all tuples a from A, a € iZ a iff a £ R , and 

• for each / £ r we have that / a (a) := / B (a) is contained in A. 
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In this case, we also say that 05 is an extension of 21. Substructures 21 of 05 and 
expansions 05 of 21 are called proper if the domains of 21 and 03 are distinct. Note 
that for every subset S of the domain of 05 there is a unique smallest substructure 
of 05 whose domain contains S, which is called the substructure of 05 generated by S, 
and which is denoted by 03 [S] . We say that 05 is finitely generated if 05 = 05 [S] for 
a finite set S of elements. A subalgebra of an algebra B (induced by S) is simply a 
substructure of B (generated by S) - recall that we have defined algebras as functions 
with a purely functional signature. 

The following is a concept that we only define for relational structures. 

Definition 2.1.2. The Gaif man- graph of a relational structure 05 with domain 
B is the following undirected graph: the vertex set is B, and there is an edge between 
distinct elements x,y € B when there is a tuple in one of the relations of 05 that has 
both x and y as entries. 

A relational structure 05 is readily seen to be connected (in the sense of Sec- 
tion |1.1[ ) if and only if its Gaifman graph is connected (in the usual graph-theoretic 
sense) . 

2.1.3. Products. Let 21 and 03 be two structures with domain A and B, and 
the same signature r. Then the (direct, or categorical) product € = 21 x 05 is the 
r-structure with domain A x B, which has for each fc-ary R £ r the relation that 
contains a tuple ({at, bt), . . . , (a k , b k )) if an d orn y if ^( a i>---i a fc) holds in 21 and 
R(bt, . . . ,bk) holds in 03. For each fc-ary / G r the structure £ has the operation 
that maps ((oi, . . . , (a k , b k )) to (/(oi, . . . , a k ), f(bt, . . . , b k )). The direct product 
21 x 21 is also denoted by 2l 2 , and the fc-fold product 21 x • • • x 21, defined analogously, 
by 2l fc . 

Remark 2.1.3. For all structures 03i,032 with finite relational signature, the 
following holds. 

CSPOBi) U CSP(03 2 ) = CSP(03i W 05 2 ) 
CSP(03i) n CSP(03 2 ) = CSP(03i x 03 2 ) 

We generalize the definition of products in the obvious way to infinite products. 
For a sequence of r-structures ($U)iei, the direct product 03 = ]^[ ig7 21; is the r- 
structure on the domain Yiiei ^ sucn that for R € r of arity fc 

{{a]) ieI , (Oi)i 6 z) € R B iff [a], ...,a^)e R %% for each % e I , 

and for / € r of arity fc, we have 

/ a> ((o j 1 ) i6J ,... J (a*) i6/ ) = (/ st «(a i 1 ,...,a i fc )) i67 . 

2.2. Mappings 

Throughout the text, we use the following conventions. When /: A — > B is a 
function, and S is a subset of A, then f(S) denotes the set {/(s) | s e S} C B. 
When t = (ti, . . . ,t k ) is a fc-tuple of elements of B, then f(t) denotes the tuple 
(f(ti),...,f(tk)). Moreover, we use the same convention for higher-ary functions 
/: B m -> B: when t 1 , . . . ,t m are fc-tuples of elements of B, then f(t l , . . . ,t m ) de- 
notes the fc-tuple (f(t{, . . . , £"'), . . . , f{t\, . . . , i™)) (that is, the fc-tuple is computed 
componentwise) . 

In the following, let 21 be a r-structure with domain A and 03 a r-structure with 
domain B. A homomorphism h from 21 to 05 is a mapping from A to B that preserves 
each function and each relation for the symbols in r; that is, 

• if (oi, . . . , a k ) is in i? a , then (h(at), ■ • ■ , h(a k )) must be in 
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. f <s (h(a 1 ),...,h(a k )) = h(f % (a 1 ,...,a k )). 
When A, B are algebras with the same signature and domain A, B, respectively, and 
/ is a homomorphism from A to B, then f(A) induces a subalgebra of B, and this 
subalgebra is called a homomorphic image of A. 

When a mapping h preserves a relation R, we also say that R is invariant under 
h. If h does not preserve R, we also say that h violates R. A homomorphism from 
21 to 03 is called a strong homomorphism if it also preserves the complements of the 
relations from 21. Injective strong homomorphisms are called embeddings. Surjective 
embeddings are called isomorphisms. A homomorphism from a substructure of 21 to 
03 is called a partial homomorphism from 21 to 03. An embedding from a substructure 
of 21 into 03 is called a partial isomorphism from 21 to 03. 

Homomorphisms and isomorphisms from 03 to itself are called endomorphisms 
and automorphisms, respectively. Structures where the identity is the only automor- 
phism are called rigid. When f : A —> B and g : B —> C, then g o f denotes the com- 
posed function x H ► g(f(x)). Clearly, the composition of two homomorphisms (em- 
beddings, automorphisms) is again a homomorphism (embedding, automorphism). 
Let Aut(2l) and End(2l) be the sets of automorphisms and endomorphisms, respec- 
tively, of 2t. The set Aut(2t) can be viewed as a group, and End(2l) as a monoid with 



respect to composition; more on that can be found in Section |3.3| and Section 3.4.2 



2.3. Formulas and Theories 

We assume familiarity with basic concepts of classical first-order logic; see for 
example |86| . In particular, we will use the concepts of conjunctive normal form 
(CNF), free and bound variables, terms and subterms, clauses, (positive and negative) 
literals, and atomic formulas. 

We always allow the first-order formula x — y (for equality) and _L (for 'false'), 
independently of the signature, unless stated otherwise. A formula without free vari- 
ables will be called a sentence. A (first-order) theory is a set of (first-order) sentences. 
A structure 03 is a model of a sentence cf> (or a theory T) if tfi (all sentences in T, 
respectively) holds true in 03; in this case we write 03 |= (j> (03 \— T). The set of 
all first-order sentences that are true in a given structure 03 is called the first-order 
theory of 03, and denoted Th(03). If a sentence or a theory has a model, we call it 
satisfiable. We state two basic facts that will be used later. 



Theorem 2.3.1 (Compactness; see Theorem 5.1.1 in 114] ). Let T be a first-order 



theory. Lf every finite subset ofT is satisfiable then T is satisfiable. 

When T is a theory and <j> a sentence, we say that T entails <f>, in symbols T \= <f>, 
if every model of T satisfies <f>. Two theories T\ , T2 are said to be equivalent if T\ |= T2 
and T 2 \=T X . 

Lemma 2.3.2 (see Lemma 2.3.2 in [114] ). Let T be a first-order r-theory, and <fi a- 
first-order t -formula with free variables X\ , . . . , x n . Let C\ , . . . , c n be distinct constants 
that are not in r. Then T \= <j){c\, . . . , Cfc) if and only if T \= Vxi, . . . , x n .(j>. 

Let 03 be a r-structure. When is a first-order r- formula, and when x\, . . . , x n is 
an ordered list that enumerates all the free variables, then (f>(xi, . . . , x n ) defines over 
03 the relation {(61, . . . , b n ) | 03 |= (j){bi, . . . , b n )}. When is a r-formula with free 
variables x\, . . . ,x n , and h is a fc-ary function then h preserves <fi if h preserves the 
n-ary relation that is defined by 0(xi, . . . ,x n ). We say that a structure 21 is (first- 
order) definable in 03 if 21 and 03 have the same domain, and every relation from 21 
has a first-order definition in 03. Two structures 21, 03 are (first-order) interdefinable 
if 21 is definable in 03 and vice versa. 
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A first-order T-formula <p is said to be 



quantifier-free if it does not contain any quantifiers; that is, it is built from 
the logical connectives A, V, the binary relation =, the (free) variables, 
and the symbols from r only (also see Section 3.6.1 1; 

in prenex normal form if it is of the form Q\Xi . . . Q n x n .ip where Qi G {V, 3} 
and ip is quantifier-free; 

Horn if it is written in conjunctive normal form and every clause has at 
most one positive literal (those formulas appear e.g. in Section 6.2); 
positive quantifier-free if <p is quantifier-free, and if in addition cp does not 
contain negation symbols -■. 

existential if it is of the form Bxi, . . . , x n . ip where ip is quantifier-free (those 
formulas appear e.g. in Section 3.6.2); 

universal if is of the form Vxi, . . . , x n . ip where ip is quantifier- free; 
3 + (existential positive) if it existential and if the quantifier-free part of 
<p does not contain any negation symbols (those formulas appear e.g. in 
Section [3~Q| ; 

V~ (universal negative) if it is of the form Va;i, . . . , x n . ->ip where ip is positive 
quantifier-free; 

universal conjunctive if it is universal and if the quantifier-free part of (p 
does not contain any negation or disjunction symbols (those formulas appear 
e.g. in Section 5.6); 

V3 (forall- exists) if it is of the form V?/i, . . . ,y m - ip where ip is existential 
(those formulas appear e.g. in Section 3.6.2); 

V3 + (positively restricted forall- exists) if it is of the form \fy.(p(y), where (p(y) 
is a positive boolean combination of quantifier-free formulas and existential 
positive formulas (those formulas appear throughout Section 3.6) 
primitive positive if it is of the form 3xi, . . . ,x n . ip\ A • • • A ip m , where 
ipi , . . . , ip m are atomic (they are of central importance in this thesis) . 



We could have equivalently defined positively restricted forall-exists formulas as 
conjunctions of universally quantified disjunctions of primitive positive formulas and 
negated atomic formulas. It is easy to see that every V3 + -formula can be re-written 
into such a formula. 

Note that homomorphisms preserve all existential positive formulas. An impor- 
tant property of primitive positive sentences </> is that 21x25 \= <p iff 21 |= <p an d 
23 |= (p. Also note that partial isomorphisms preserve quantifier-free formulas, embed- 
dings preserve existential formulas, and isomorphisms preserve first-order formulas. 
Embeddings that preserve all first-order formulas are called elementary. When 23 is 
an extension of 21 such that the identity map from 23 to 21 is an elementary embed- 
ding, we say that 23 is an elementary extension of 21, and that 21 is an elementary 
substructure of 23. 



TTi] ). Let 21 be a 



Theorem 2.3.3 (Lowenheim-Skolem; see Corollary 3.1.4 in 
T-structure, X a set of elements of%, and A a cardinal such that |r| + \X\ < X <\A\. 
Then 21 has an elementary substructure 23 of cardinality A with X C B. 



A first-order theory T is said to be existential if all sentences in T are existential, 
and the set of all existential r-sentences that is true in a T-structure 23 is called 
the existential theory of 23. Analogously, we define V3 + , V3, existential, universal, 
existential positive, and universal negative theories. 
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2.4. Diagrams 

We need the concept of a diagram of a structure, in various variants. The idea is 
to transform a structure into a formula, similarly as in the definition of the canonical 
query given in Section [l.2| 



Definition 2.4.1. Let 21 be a r-structure so that in 21 every element is named by 
a constant. Then 

• the set of all positive quantifier-free sentences that hold on 21 is denoted by 
diag + (QL), 

• the set of all quantifier-free sentences that hold on 21 is denoted by diag{$i), 

• the set of all universal negative sentences that hold on 21, is denoted by 

• the elementary diagram of 21 is the set of all first-order sentences true in 21, 
and is denoted by diagj a {%). 

The following is straightforward from the definitions. 

Lemma 2.4.2 (Diagram lemma; Lemma 1.4.2. in |114 ). Let 21 and^& be relational 



T-structures, and lettyj be (t Lip) -expansion o/2l by constant symbols p such that every 
element of 21' is named by a constant. Then the following are equivalent. 

(1) There is a (r U o~)-expansion 03' of 03 such that 03' |= rfiag + (2t'); 

(2) There is a homomorphism from 21 to 03. 

Diagrams are useful in the proof of the following elementary, but important 



lemma, which has been called the existential amalgamation theorem in 114 . The 
proof is analogous to the proof of Lemma |2.4.4| given in full detail below, and we 
therefore omit it. 



Proposition 2.4.3 (Theorem 5.4.1 in 114| ). Let 21 and 03 be T-structures with 



domain A and B, respectively. Suppose that a lists a subset S of A, and let h : S — > B 
be a partial homomorphism from 21 to 03 such that every existential sentence true in 
(03, h(a)) is also true in (21, a). Then there exists an elementary extension £ o/2l and 
an embedding g: 03 — > £ such that g(h(a)) = a. 

We will need several times a positive variant of Proposition 2.4.3[ which is explic- 
itly given in 114| , but without proof. 



Lemma 2.4.4 (Theorem 5.4.7 in 114| ). Let 21 and 03 be T-structures with domain 



A and B , respectively. Suppose that a lists a subset S of A, and let h : S — > B be a 
partial homomorphism from 21 to 03 such that every existential positive sentence true 
in (03, h(a)) is also true in (21, a). Then there exists an elementary extension £ o/2t 
and a homomorphism g: 03 — > £ such that g(h(a)) = a. 

Proof. Similar to the proof of 5.4.1 in [114] . Let a be a set of constant symbols 
and 03' be a (r U a)-expansion of 03 such that every element of 03' is denoted by some 
constant symbol from a. Let 21' be the expansion of 21 by those constant symbols 
c e a that denote an element h(a) in 03' for a £ S; the constant c then denotes a in 21'. 
It suffices to show that the theory T := diag fo (2t') U diag + (03') has a model £', since 



Lemma 2.4.2 then asserts the existence of a homomorphism from 03 to the r-reduct £ 
of £', which will be an elementary extension of 21. Moreover, such a homomorphism 
must map h(a) to a. 

If T has no model, then by the compactness theorem there is a conjunction 
4>(a,c) of finitely many sentences in diag + (03') such that 21 \= ~^3y.(p(a,y). Since 4> 
is quantifier-free and positive, the assumptions imply that 03 |= -<3y.(j>(a,y). This 
contradicts that ^(a, c) is true in 03. □ 
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We can now prove a generalization of the condition given in Proposition |1.3.4| 
from Section 1X731 that characterizes when two theories have the same CSP. 



Proposition 2.4.5. Let T and T' be r-theories. The following are equivalent. 

(1) Every model of T has a homomorphism to a model of T' , and every model 
of T' has a homomorphism to a model of T . 

(2) T and T' imply the same universal negative sentences. 

Proof. To prove the implication from (1) to (2), assume (1), and let be a 
universal negative sentence implied by T", and let £ be a model of T. By (1), there 
is a homomorphism from £ to a model 58 of T". Since —i<j> is equivalent to an ex- 
istential positive sentence, it is preserved by homomorphisms, and hence we have a 
contradiction to the assumption that T' \= <f). 

For the implication from (2) to (1), assume (2), and let 58 be a model of T. Let 
S be the existential positive theory of 58. We claim that S U T" is satisfiable. If not, 
then by compactness (Theorem |2.3.1 ) there is some finite subset {(f>i, ■ of S 



such that T' h {^<f>\ V • ■ ■ V -«f>k)- The formula -xf>i V ■ • • V -^<f>k is equivalent to a 
universal negative sentence ip, and T' h tp, so by (2) we have that T h ijj, and hence 
58 |= ip. We have reached a contradiction, since 58 \— <j>i for all? < k. So there indeed 



exists a model 21 of S U T' . Lemma 2.4.4 applied to 21 and 58 for the empty sequence 



a gives a model £ of T' U S and a homomorphism from 58 to £. □ 

This indeed proves Proposition 1 1 . 3 .4] since theories that imply the same universal 
negative sentences have obviously the same CSP. It is now also easy to prove Propo- 
sition |1.3.6| from Section |1.3[ characterizing those theories T for which there exists a 
structure 58 such that CSP(T) = CSP(58). We first show the following. 

Proposition 2.4.6. For any satisfiable theory T , the following are equivalent. 

(1) there exists a structure 58 that satisfies every existential positive sentence </) 
where T U {(/)} is satisfiable. 

(2) T has a model 58 that satisfies every existential positive sentence (j) where 
TU {(/)} is satisfiable. 

(3) For all existential positive sentences (f>i and 4>2, ifTU {4>i} is satisfiable and 
TU {4>2\ is satisfiable, then TU {<f>i,<f>2} is satisfiable as well. 



(4) T has the Joint Homomorphism Property (JHP - confer Proposition 1.3.6) 



PROOF. We prove (1) O (2), (2) <^ (3), (3) <^ (4). For the implication from 
(1) to (2), let T' be the universal negative theory of 58. By assumption, T' and T 



imply the same universal negative sentences, and hence by Proposition 2.4.5 there is 
a homomorphism from 58 to a model £ of T. This model ft has the desired property: 
if cj) is existential positive such that TU{(f>} is satisfiable, then 58 satisfies 4> and since 
homomorphisms preserve existential positive formulas, £ satisfies <fi as well. 

The implication from (2) to (1) and the implication from (2) to (3) are obvious. 
To show that (3) implies (2), assume (3). Let P be the set of all existential positive 
sentences <f> such that T U is satisfiable. By the assumption that T is satisfiable, 
and by (3), all finite subsets of TUP are satisfiable, so by compactness of first-order 



logic (Theorem 2.3.1 ) we have that TUP has a model 58. 

Now we show that (3) implies (4). Assume (3), and let r be the signature of T. 
Let 2li and 2I2 be models of T. We have to show that there exists a model 58 of T 
that admits homomorphisms from 2li and 2I2 ■ Let 2l' x and 2l 2 be expansions of 21 1 
and 2I2, respectively, where every element is denoted by a distinct constant symbol. 
Consider the theory T' := T U diag + (21^ ) U diag + (2t 2 ); we claim that T 1 is satisfiable. 



By compactness (Theorem 2.3.1), it suffices to show that every finite subset S of T" 



is satisfiable. Let Si := S fl diag + (2t) and S2 :— S diag + (2l2). By forming a finite 



42 



2. PRELIMINARIES IN LOGIC 



conjunction, we see that Si and S2 are logically equivalent to single sentences 0i and 
02, respectively. Certainly T U {0i} and T U {02 } are satisfiable since 21^ and 2l 2 
are expansions of models of T and therefore satisfy all sentences from T. By (3), the 
theory TU {01,02} is satisfiable as well. Therefore the claim is true, and there exists 



a model 03' of T' . Let 03 be the r-reduct of 03'. Finally, Lemma 2.4.2 asserts the 
existence of a homomorphism from 21 1 to 03 and from 2I2 to 03, which proves (4). 

For the implication from (4) to (3), suppose that T has the JHP, and that 0i and 
02 are existential positive sentences such that TU{0i} has a model 2li and TU{02} has 
a model 2I2 . By (4) , there exists a model 03 of T such that 21 1 and 21 2 homomorphically 
map to 03. Then 03 clearly satisfies TU {01,02} since homomorphisms preserve 
existential positive sentences. □ 

Note that in the statement and the proof above, the phrase existential positive 
can be used interchangeably with the phrase primitive positive. With the additional 



assumption that T has a finite relational signature, item (1) in Proposition 2.4.6 
becomes the statement that there exists a structure 03 such that CSP(03) and CSP(T) 
are the same problem, so we indeed proved in particular Proposition 1 1 . 3 . 6} 

2.5. Types 

A set of formulas $ with free variables x\,...,x n is called satisfiable over a 
structure 03 if there are elements b\ , . . . , b n of 03 such that for all sentences 6$ 
we have 03 |= 0(i>i, . . . , b n ). We say that $ is satisfiable if there exists a structure 03 
such that $ is satisfiable over 03. For n > 0, an n-type of a theory T is a set p of 
formulas with free variables X\, . . . ,x n such that p U T is satisfiable. An n-type p of 
T is maximal if T U p U 4>{x\, . . . , x n ) is unsatisfiable for any formula ^ T U p. The 
set of all complete n- types of Th(2l) is denoted by S„. An n-type of a structure 03 is 
an n-type of the first-order theory of 03. 

In a similar manner, an existential positive n-type (ep-n-type) of a theory T is 
a set of existential positive formulas p with free variables x±, . . . , x n such that pUT 
is satisfiable. A ep-n-type p of T is maximal if T U p U 0(xi, . . . , x n ) is unsatisfiable 
for any existential positive formula ^ T U p. A fep-j n-type of a structure 21 is 
a (ep-) n-type of the theory Th(2t). When S is the universal negative theory of 21, 
then note that p U Th(2t) is satisfiable if and only if p U 5 is satisfiable; thus we could 
equivalently have defined ep-n-type with respect to the latter theory. 

When p is an n-type, and I C {l,...,n} with |/| = fc, then the subtype of p 
induced by I is the fc-type obtained from p by existentially quantifying in all formulas 
in p the variables Xi for i £ {1, . . . , n} \ /, and then renaming the variables in the 
resulting set of formulas to x\, . . . , Xk in a way that preserves the order on the indices 
of the variables. 

An n-type p of 21 is realized in 21 if there exist a\, . . . ,a n G A such that 21 |= 
0(ai, . . . , a n ) for each £ p. The set of all first-order formulas with free variables 
xi, . . . , x n satisfied by an n-tuple a — (ai, . . . , a„) in 21 is a maximal type of 21, and 
called the type of a, and denoted by tp a (a). 

For an infinite cardinal k, a structure 21 is n-saturated if, for all /3 < K and 
expansions 21' of 21 by at most (3 constants, every 1-type of 21' is realized in 21'. We 
say that an infinite 21 is saturated when it is |^4|-saturated. Realization of pp-types 
and pp-(ft)-saturation are defined analogously. 

Theorem 2.5.1 (Corollary 8.2.2 in |114| ). Let t be a signature and A > |r|. Then 
every r-structure 03 has an \ + -saturated elementary extension of cardinality < \B\ X . 



CHAPTER 3 



Model Theory 




Hodges 114 writes that "model theory is about the classification of mathematical 
structures, maps and sets by means of logical formulas". This text is about the 
computational complexity of constraint satisfaction problems for infinite structures 25 
— and since the constraint satisfaction problem of *B (and in particular its complexity) 
is fully determined by the first-order theory of *B, it is not surprising that model theory 
has a great deal to say about constraint satisfaction problems. 

Many important infinite-domain constraint satisfaction problems can be formu- 
lated with templates that are u -categorical. The concept of w-categoricity is of central 
importance in model theory, and for reasons that will become clear in Section [XT] of 
this chapter, also in permutation group theory. From a model-theoretic perspective, 
w-categoricity is a very strong assumption - but still many problems that have been 
studied in the literature, in particular constraint satisfaction problems for qualita- 
tive reasoning formalisms in artificial intelligenc^J can be formulated as CSPs with 
w-categorical templates. We will also see that every connected monotone monadic 
SNP sentence (the corresponding problems have been called forbidden patterns prob- 
lems and studied in |147 - |149] ) describes a constraint satisfaction problem of an 
w-categorical structure (Section 4.5). 



In this section we present general results about w-categorical structures: for ex- 



ample how to construct them (Sections 3.1 and 3.2 and 3.5 1, and how to algebraically 



lr The question which reasoning formalisms in Artificial Intelligence should be called qualitative 
has been the topic of scientific discussion [143] . My own response to this question is: it is qualitative 
if and only if it can be formulated with an oj-categorical template. 
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characterize syntactically restricted definability of relations over w-categorical struc- 



tures (Sections 3.3 3.4 and 3.6). Section 3.5.2 gives an exact characterization of those 
constraint satisfaction problems that can be formulated with w-categorical templates. 

There is already an excellent literature on w-categoricty: most notably, the book 
by Cameron 65 , the recent survey by Macpherson 14 6], a nd the collection 125 . 
Moreover, classical text-books on model theory, such as [114 127 151 , always treat 
w-categority, and use w-categorical structures as a rich source of examples. The 
present chapter is different from those in that it focusses on techniques and facts 
that will be relevant for complexity classification for the corresponding constraint 
satisfaction problems. It contains many results that are not contained in any of the 
sources mentioned above (and which have been published in 33 36 - 38||44||46 48 
108| ). Some parts can be derived from proofs in Hodges' book [114] and its original 



longer version 113 ; our citation policy is to give the reference to the shorter version, 



whenever this is possible, since it is more widely accessible. When this is not possible, 
for what is relevant in this text, the long version subsumes the short 



we quote 
version. 
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3.1. w-categorical Structures 

"Every statement about all uj- categorical structures is either trivial, or false." 

(Martin Ziegler, 2005) 



A satisfiable first-order theory T is called uj -categorical if all countable models 
of T are isomorphic. A structure is called uj- categorical if its first-order theory is ui- 
categorical. Since almost all w-categorical structures that appear in this text will be 
countably infinite, we make the convention that w-categorical structures are count- 
ably infinite. One of the first structures that were found to be w-categorical (by 
Kantor [66] ) is the linear order of the rational numbers (Q; <), which we will use as 
a running example in this section. We will see many more examples of w-categorical 
structures in this section, in Section |3.2[ and in Chapter [4] One of the standard 
approaches to verify that a structure is w-categorical is via a so-called back- and- forth 
argument. To illustrate, we give the back-and-forth argument that shows that (Q; <) 
is w-categorical; much more about this important concept in model theory can be 



found in 114 168 



PROPOSITION 3.1.1. The structure (Q; <) is uj- categorical. 

Proof. Let 21 be a countable model of the first-order theory T of (Q; <). It is 
easy to verify that T contains (and, as this argument will show, is uniquely given by) 

• Bx. x — x (no empty model) 

• Vx, y, z ((x < y A y < z) =4> x < z) (transitivity) 

• Vx, y. -i(x < x) (irreflexivity) 

• Vx, y(x<yVy<xVx = y) (totality) 

• Vx By. x < y (no largest element) 

• Vx By. y < x (no smallest element) 

• Vx, z By (x < y A y < z) (density). 

An isomorphism between 21 and (Q; <) can be defined inductively as follows. 
Suppose that we have already defined / on a finite subset S of Q and that / is an 
embedding of the structure induced by S in (Q; <) into 21. Since < a is dense and 
unbounded, we can extend / to any other element of Q such that the extension is 
still an embedding from a substructure of Q into 21 (going forth). Symmetrically, for 
every element v of 21 we can find an element u G Q such that the extension of / that 
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maps u to v is also an embedding (going back). We now alternate between going forth 
and going back; when going forth, we extend the domain of / by the next element of 
Q, according to some fixed enumeration of the elements in Q. When going back, we 
extend / such that the image of A contains the next element of 21, according to some 
fixed enumeration of the elements of 2t. If we continue in this way, we have defined 
the value of / on all elements of Q. Moreover, / will be surjective, and an embedding, 
and hence an isomorphism between 21 and (Q; <). □ 

A second important running example of this section is the random graph (V; E) , 
which is a (simple and undirected) graph with a countably infinite set of vertices V 
that is defined uniquely up to isomorphism by the following extension property: for 
all finite disjoint subsets U, U' of V there exists a vertex v € V \ (U U U') such that v 



is adjacent to all vertices in U and to no vertex in U' . We will see in Section 3.2 that 
such a graph indeed exists (Theorem 3.2.2[ ). 



Proposition 3.1.2. The random graph (V;E) is ui-categorical. 

Proof. Note that the defining property of (V;E) given above is a first-order 
property; a simple back-and-forth argument shows that every countably infinite graph 
with this property is isomorphic to (V; E) . □ 

The theorem of Ryll-Nardzewski. There are many equivalent characteriza- 
tions of w-categoricity; the most important one is in terms of the automorphism 
group of 58. In the following, let ? be a set of permutations of a set X. We 
say that is a permutation group if for g, f G §f the functions x H > g(f{x)) and 
x i y g~ l (x) are also in For n > 1 the orbit of (t\, . . . , t n ) £ X n under Sf is the set 
{(a(ti),..., a(t„)) | a £ Sf}. 

Definition 3.1.3. A permutation group & over a countably infinite set X is 
oligomorphic if has only finitely many orbits of n-tuples for each n > 1 . 

An accessible proof of the following theorem can be found in Hodges' book (The- 
orem 6.3.1 in 114| ). 



Theorem 3.1.4 (Engeler, Ryll-Nardzewski, Svenonius). For a countably infinite 
structure 58 with countable signature, the following are equivalent: 

(1) 58 is lu -categorical; 

(2) the automorphism group Aut(58) o/58 is oligomorphic; 

(3) for each n > 1, there are finitely many inequivalent formulas with n free 
variables over 58; 

(4) for all n > 1, every set of n-tuples that is preserved by all automorphisms of 
58 is first- order definable in 58. 



The fourth of those conditions is missing in Theorem 6.3.1 of 114 ; but the 
implication from (3) to (4) follows from the proof given there. Conversely, suppose 
that Aut(58) are infinitely many orbits of n-tuples, for some n. Then the union of 
any subset of the set of all orbits of n-tuples is preserved by all automorphisms of 58 ; 
but there are only countably many first-order formulas over a countable language, so 
not all the invariant sets of n-tuples can be first-order definable in 58. 



The second condition in Theorem 3.1.4 provides another possibility to verify that 



a structure is w-categorical. We again illustrate this with the structure (Q;<). It 
is not difficult but a good exercise to verify that the orbit of an n-tuple (ii,... ,t n ) 
from Q™ with respect to the automorphism group of (Q; <) is determined by the weak 
linear order induced by (t\, . . . ,t n ) in (Q; <). We write weak linear order, and not 
linear order, because some of the elements t\, . . . , t n might be equal. That is, a weak 
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linear order is a total quasiorder. There are less than n n such orders, and hence the 
automorphism group of (Q; <) has a finite number of orbits of n-tuples, for all n > 1. 

Lemma [3. 1.5| below states a useful property that w-categorical structures have in 
common with finite structures, and is an easy consequence of Konigs tree lemma. 

Lemma 3.1.5. Let 58 be a finite or an infinite uj- categorical structure with rela- 
tional signature t, and let 21 be a countable T-structure. If there is no homomorphism 
(embedding) from 21 to 58, then there is a finite substructure o/2l that does not ho- 
momorphically map (embed) to 58. 

Proof. We present the proof for homomorphisms; the proof for embeddings is 
analogous. Suppose every finite substructure of 21 homomorphically maps to 58. We 
show the contraposition of the lemma, and prove the existence of a homomorphism 
from 21 to 58. Let 01,02, • • • be an enumeration of 21. We construct a rooted tree 
with finite out-degree, where each node lies on some level n > 0. The nodes on 
level n arc equivalence classes of homomorphisms from the substructure of 21 induced 
by ai,...,a n to 58. Hence, there is only one vertex on level 0, which will be the 
root of the tree. Two such homomorphisms / and g are equivalent if there is an 
automorphism a of 58 such that fa = g. Two equivalence classes of homomorphisms 
on level n and n + 1 are adjacent, if there are representatives of the classes such 
that one is a restriction of the other. Theorem |3.1.4 asserts that 21 has only finitely 



many orbits of fc-tuples, for all k > (clearly, this also holds if 58 is finite). Hence, 
the constructed tree has finite out-degree. By assumption, there is a homomorphism 
from the structure induced by a\, a^, ■ ■ ■ , a n to 58 for all n > 0, and hence the tree 
has vertices on all levels. Konig's lemma asserts the existence of an infinite path in 
the tree, which can be used to inductively to define a homomorphism h from 21 to 58 
as follows. 

The restriction of h to {ai, . . . , a n } will be an element from the n-th node of the 
infinite path. Initially, this is trivially true if h is restricted to the empty set. Suppose 
h is already defined on a\, . . . , a n , for n > 0. By construction of the infinite path, 
we find representatives h n and h n +i of the n-th and the (n + l)-st element on the 
path such that h n is a restriction of h n +i- The inductive assumption gives us an 
automorphism a of 21 such that a(h n (x)) — h(x) for all x G {ai,...,a„}. We set 
h(a n+ i) to be a(h n+ i(a n+ i)) . The restriction of h to a\, . . . , a„ +1 will therefore be 
a member of the (n + l)-st element of the infinite path. The operation h defined in 
this way is indeed a homomorphism from 21 to 58. □ 

The assumption that 21 is countable is necessary in Lemma |3.1.5| consider for 
example 21 := (R; <), which does not admit a homomorphism to 58 := (Q; <) for 
cardinality reasons, even though any finite substructure of 21 does. 

Corollary 3.1.6. For any structure £, there is a finite structure 58 with the 
same CSP as £ if and only if the core of <L is finite. 

Proof. If there exists a finite structure 58 with the same CSP as £, then ev- 
ery finite substructure of £ homomorphically maps to the core 58' of 58, and by 
Lemma |3.1.5 there exists a homomorphism from £ to the finite core structure 58' 



(which is unique up to isomorphism); since 58' also maps to £, it is a core of C The 
converse is trivial. □ 



Corollary 3.1.7. Two countable uj- categorical relational r-structures 21 and 58 
have the same CSP if and only if there is a homomorphism from 21 to 58 and a 
homomorphism from 58 to 21. 
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Corollary |3.1.7| is false for general countable relational structures. Consider for 
example the structure (Z; {(x, y) \ y = x + 1}) — the 'infinite line', and the structure 
(N; {(x, y) | y = x + 1}) — the 'infinite ray'. Clearly, these two structures give rise to 
the same CSP, but there is no homomorphism from the line to the ray. 



Several times we need variants of Lemma 3.1.5 that can be proved in the same 
way. For instance, we can replace homomorphism in the statement and the proof by 
strong homomorphism, or injective homomorphism, or mappings satisfying universal 
identities such as Vx,y. f{x,y) — f(y,x). What is common for all those statements 
is that the respective property of the function can be expressed by a universal first- 
order sentence. We make this more precise and derive the following generalization of 
Lemma |3. 1 . 5| based the compactness theorem. 

Lemma 3.1.8. Let 03 be a finite or an infinite u>- categorical structure with re- 
lational signature r , let 21 be a countably infinite r-structure, and let a be a set of 
function symbols. Then for any universal (rU a) -theory T the following are equivalent. 

(1) The two-sorted structure (21, 03) has a (t p U a) -expansion that satisfies T such 
that every f <E a denotes a function from 21 to 03 . 

(2) For every finite induced substructure £ of 21 the two-sorted structure (£, 03) 
has a (t U a) -expansion that satisfies T such that every f G a denotes a 
function from £ to 03 . 

Proof. Any substructure of a model of a universal theory is again a model of 
the theory, so (1) implies (2). Conversely, we prove the existence of a homomorphism 
from 21 to 03 by a compactness argument as follows. Let Pa and Pb be two unary 
relation symbols not contained in r. Let 21' be an expansion of 21 by countably many 
constants such that every element of 21 is named by a constant symbol; let r' be the 
(countable) signature of 21'. Let R be the diagram of 21', and let S be a set of universal 
first-order sentences that 

• says that Pa and Pb partition the set of all elements, 

• forces that all function symbols from a denote functions from the elements 
in Pa to the elements in Pb, and 

• expresses that the r-reduct of the structure induced by the elements of Pb 
has the same first-order theory as 03. 

We first prove that R U S U T is satisfiable. By compactness, it suffices to prove 
satisfiability of R' U S U T for all finite subsets of R' . Let c\, . . . , c n be the constant 
symbols mentioned in R'. Let £ be the structure induced by {ci, . . . , c„} in 21. Clearly, 
€\= R'. By assumption, the structure (£,03) with the two sorts Pa and Pg can be 
expanded to a (two-sorted) (a U restructure D that satisfies T; it is straightforward 
to see that 2) satisfies the sentences in S. When we name the elements in the first 
sort Pa by the corresponding constants c\, . . . , c„, then the expansion satisfies also 
R! , and so we have found a model of R' U S U T. 



So there exists an (infinite) model of R U S U T, and by Theorem 2.3.3 and since 
r' is countable there is also a countably infinite model 971 of R U S U T. Consider the 
substructure of 971 generated by the elements in Pa, and all the elements in Pg. It can 
be checked that the resulting structure 97t' still satisfies R and S, and since universal 
sentences are preserved by taking substructures, 971' also satisfies T. Note that in 
971', the elements from Pa induce a copy of 21, and the elements from Pg induce a 
structure that is isomorphic to 03, since 03 is finite or w-categorical. The functions 
denoted by the function symbols from a provide the required (a U r)-expansion of 
(21,03). □ 
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Lemma |3.1.8| is indeed a generalization of Lemma |3.1.5| to make sure that / is a 
homomorphism, T contains for every relation symbol R £ r the sentence Wx.(R(x) => 
R(f(x)). 

First-Order Interpretations. Many w-categorical structures can be derived 
from other w-categorical structures via first-order interpretations (our definition fol- 
lows [114] ). 

If S(x\, . . . , Xfe) is a first-order r-formula with k free variables x%, . . . , Xk, and 21 
is a r-structure, we we write S(^l k ) for the fc-ary relation that is defined by 5 on 21. 

Definition 3.1.9. ^4 relational a -structure 03 /ias a (first-order) interpretation J 
in a t -structure 21 if there exists a natural number d, called the dimension of I, and 

• a r-formula 5i{x\, . . . , xj) - called domain formula, 

• for each atomic a -formula 4>(yi, ■ . ■ , yt) o r-formula (f>i(xi, ■ . ■ , Xk) where the 
Xi denote disjoint d-tuples of distinct variables - called the defining formulas, 

• a surjective map h : <5j(2l d ) — > B - called coordinate map, 
such that for all atomic a -formulas <f> and all tuples Hi € <5j(2l d ) 

03 (=0(h(oi),...,/i(ajO) O 2lh0/(ai,...,a fe ) . 

If the formulas Si and <pi are all primitive positive, we say that the interpretation 
I is primitive positive. Note that the dimension d, the set S := <5(2l fc ), and the 
coordinate map h determine the defining formulas up to logical equivalence; hence, 
we sometimes denote an interpretation by I = (d, S, h) . Note that the kernel of h 
coincides with the relation defined by (x = y)j, for which we also write =/, the 
defining formula for equality. 

We say that 03 is interpretable in 21 with finitely many parameters if there are 
c\ , . . . , c„ € A such that 03 is interpretable in the expansion of 21 by the singleton 
relations {cj} for all 1 < i < n. A first-order definition of one structure in another 
is in model theory often the special case of an interpretation / where =/ is simply 
the equality relation; in this text, however, we say that a structure 03 is (first-order) 
definable in 21 if 03 has a 1-dimensional interpretation / in 03 where =j is the equality 
relation and the domain formula Si is logically equivalent to true. 



Lemma 3.1.10 (see Theorem 7.3.8 in 113 ). Let 21 be an uj- categorical struc- 
ture. Then every structure 03 that is first-order interpretable in 21 with finitely many 
parameters is lo- categorical or finite. 

Note that in particular all reducts of an w-categorical structure and all expansions 
of an w-categorical structure by finitely many constants are again w-categorical. 



Example 3.1.11. In Section [L5| we have described Allen's Interval Algebra for 
temporal reasoning in Artificial Intelligence [5] , and the corresponding CSP. Formally, 
it is easiest to describe the template 21 for this CSP by a first-order interpretation 
/ in (Q; <). The dimension of the interpretation is two, and the domain formula 
5 1 (x, y) is x < y. Hence, the elements of 21 can indeed be viewed as non-empty closed 
intervals [x,y] over Q. The template 21 contains for each inequivalent {<}-formula 
4> with four variables a binary relation R such that (01,02,03,04) satisfies <f> if and 
only if ((01,02), (03,04)) G R. In particular, 21 has relations for equality of intervals, 
containment of intervals, and so forth. By Lemma [3.1.10[ 21 is w-categorical. □ 

3.2. Fraisse Amalgamation 

A versatile tool to construct cj-categorical structures is Fraisse-amalgamation. We 
present it here for the special case of relational structures] this is all that is needed in 
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the examples we are going to present. For a stronger version of Fraisse-amalgamation 



for classes of structures that might involve function symbols, see 114 . 

In the following, let r be a countable relational signature. The age of a r-structurc 
21 is the class of all finite r-structures that embed into 21. Let 03i, 032 be r-structures 
such that 21 is an induced substructure of both 03 1 and 032 and all common elements 
of 03 1 and 032 are elements of 21; note that in this case 2t = 03 1 n 032- Then we call 
03i U 032 the free amalgam of 03i,032 over 21. More generally, a r-structure £ is an 
amalgam o/03i and 032 over 21 if for i = 1,2 there are embeddings fa of 03^ to £ such 
that /1 (a) = / 2 (a) for all a e 21. 

Definition 3.2.1. An isomorphism-closed class C of r-structures has the amal- 
gamation property if for all 21, 03i, 03 2 € C with 21 = 03i n 03 2 there is a £ & C that 
is an amalgam o/03i and 032 over A. A class of finite r-structures that contains at 
most countably many non-isomorphic structures, has the amalgamation property, and 
is closed under taking induced substructures and isomorphisms is called an amalga- 
mation class. 

Note that since we only look at relational structures here (and since we allow 
structures to have an empty domain), the amalgamation property of C implies the 
joint embedding property (JEP) for C, which says that for any two structures 03i, 032 
there exists a structure £ that embeds both 03 1 and 032- 



A structure 21 is homogeneous (sometimes also called ultra-homogeneous 114 ) if 
every isomorphism between finitely generated substructures of 21 can be extended to 
an automorphism of 21. 



Theorem 3.2.2 (Fraisse 91|92 ; see 114| ). Let t be a countable relational signa- 



ture and letC be an amalgamation class of r-structures. Then there is a homogeneous 
and at most countable r-structure £ whose age equals C. The structure £ is unique 
up to isomorphism, and called the Frai'sse-limit of C. 

Example 3.2.3. Let C be the class of all linear orders. Then C is clearly closed 
under isomorphisms and induced substructures, and has countably many isomorphism 
types. To show that is also has the amalgamation property, let 21, 03i,032 G C, and 
ei : 21 — >■ 03i and e-i : 21 — > 032 be embeddings. Since C is closed under isomorphisms, 
we can assume without loss of generality that e\ and ei are the identity function. Let 
£ be the free amalgam of 03i and 032 over 21. Then the transitive closure of £ is in C, 
and is an amalgam (but not a free amalgam) of 03i and 032 over 21. It follows that C 
is an amalgamation class. By homogeneity, the Frai'sse-limit of C is unbounded and 



dense, and hence isomorphic to (Q; <) by Proposition 3.1.1 □ 



Example 3.2.4. Let C be the class of all finite partially ordered sets. Again, 
amalgamation can be shown by computing the transitive closure. The Frai'sse-limit 
is called the homogeneous universal partial order. □ 

Example 3.2.5. Let C be the class of all finite graphs. It is even easier than in 
the previous examples to verify that C is an amalgamation class, since here the free 
amalgam itself shows the amalgamation property. We can use homogeneity to verify 
that the Frai'sse-limit of C satisfies the defining property of the random graph (V; E) 



(the existence of the random graph was left open in Section 3.1 ). □ 



Example 3.2.6. Henson 110 used Frai'sse limits to construct 2" many w-categorical 
directed graphs. A tournament is a directed graph without self-loops such that for 
all pairs x, y of distinct vertices exactly one of the pairs (x, y), (y, x) is an arc in the 
graph. Note that for all classes N of finite tournaments, Forb(A/") is an amalgamation 
class, because if 2li and 2I2 are directed graphs in Forb(A/") such that 21 = 2ti n 2I2 is 
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an induced substructure of both 21 1 and 21 2 t then the free amalgam 21 1 U 2l 2 is also in 
Forb(AA). 

Hcnson in his proof specified an infinite set T of tournaments Ti,T2, . . . with 
the property that %i does not embed into %j if i =/= j; the set T will be described in 



Section 11.3 Note that this property implies that for two distinct subsets A/"i and 
A/2 of T the two sets Forb(A/i) and Forb(A/2) are distinct as well. Since there are 2 U 
many subsets of the infinite set T, there are also that many distinct homogeneous 
(and therefore w-categorical) directed graphs; they are often referred to as Henson 
digraphs. □ 

The structures from Example |3.2.6| can be used to prove various negative results 
about homogeneous structures with finite signature, for instance in Section [8. 4| and in 



Section [il.3| A better behaved class of structures are homogeneous structures whose 



age is finitely bounded (this is the same terminology as in 146 ) . 

Definition 3.2.7. We say that a class C of finite relational r-structures (or a 
structure with age C) is finitely bounded if r is finite and there exists a finite set of 
finite t -structures M such that C — Forb(A/"). 

Proposition 3.2.8. When 23 is finitely bounded, then CSP(23) is in NP. 

Proof. The problem CSP(23) is in monotone SNP (Section O). □ 



Frai'sse's theorem can be used to construct w-categorical structures, because ho- 
mogeneous structure with finite relational signature are w-categorical. More generally, 
we have the following. 

Lemma 3.2.9. Let £ be a countably infinite homogeneous structure such that for 
each k only a finite number of distinct k-ary relations can be defined by atomic for- 
mulas. Then £ is w- categorical. 

Proof. By homogeneity of £, the atomic formulas that hold on the elements of 
t in £ determine the orbit of t in Aut(2i). Since there are only finitely many such 
atomic formulas, it follows that there are finitely many orbits of /c-tuples in Aut(£). 
The claim follows by Theorem |3.1.4| □ 

It is sometimes convenient to define an w-categorical r-structure 23 by speci- 
fying an amalgamation class C with a signature that is larger than r such that 25 
is a reduct of the Frai'sse- limit of C. If the Fraisse-limit of C satisfies the condi- 
tion of Lemma |3.2.9| it will be w-categorical, and therefore also all its reducts are 



cj-categorical (Lemma 3.1.10). This method is for instance used in Section 4.1 



This technique has been used in 116 to give a new proof of a theorem due to 



Cherlin, Shelah, and Shi. The result appears in 70 for the special case where r 



has a single binary relation denoting the edge relationship of undirected graphs. The 



statement for general relational signatures r also follows from a result of 74 . The 
theorem of Cherlin, Shelah, and Shi will be useful in Section [475] 

Let A^ be a finite set of finite structures with a finite relational signature r. Recall 
that a r-structure 93 is called M -free if there is no homomorphism from any structure 
in N to 05. A structure 21 in a class of structures C is called universal for C if it 
contains all structures in C as an induced substructure. Recall that a structure is 
connected if it can not be given as the disjoint union of non-empty structures. 



Theorem 3.2.10 (of |70| ; also see 116| ). LetM be a finite set of finite connected 



r-structures. Then there is an lj -categorical M -free r-structure 58 that is universal 
for the class of all countable M -free structures. The structure 53 can be expanded by 
finitely many primitive positive definable relations so that the expanded structure is 
homogeneous. 
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We want to remark that the structure 03 from Theorem 3.2.10 is uniquely (up to 
isomorphism) given by the fact that it is A/"- free, universal for the class of all finite 
7V-frcc graphs, and that it can be expanded by existentially definable relations so that 



the expansion is homogeneous; we come back to this in Section 3.6.2 (Theorem 3.6.8) 



3.3. Oligomorphic Permutation Groups 



We have seen in Section |3.1| that a structure is w-categorical if and only if its 
automorphism group is oligomorphic, i.e., has for each n > 1 only finitely many orbits 
of n-tuples. This section describes this connection between logic and permutation 
groups in more detail. 

3.3.1. Local Closure. Automorphism groups of relational structures 03 have 
the property that they are locally closed, in the following sense. 

Definition 3.3.1. A set of permutations & of a set B is called (locally) closed 
if 2? contains all permutations p with the property that for every finite subset A of B 
there exists q G such that p(x) = q{x) for all x G A. The (local) closure of a set 
of permutations is the smallest locally closed set of permutations that contains 2? . 

Proposition 3.3.2. Let & be a set of permutations of some base set B. Then 
the following are equivalent. 

(1) 2P is the automorphism group of a relational structure; 

(2) 2? is a locally closed permutation group; 

(3) is the automorphism group of a homogeneous relational structure. 

In the proof of this proposition, the following concept is useful. When J?" is a 
subset of B B, then Inv(jF) denotes the set of all relations over B that are invariant 
under all functions from JF. A relational structure over the base set B whose relations 
are exactly the relations from Inv(J£") is called a canonical structure^ior J?. 

Proof of Proposition 13.3.21 For the implication from (1) to (2), let 5? be the 
automorphism group of a relational structure 03 with domain B, and let p G B — > B 
be such that for each finite subset A there exists age ^ such that p(x) = q(x) for 
all x G A. Then p must preserve all relations from 03, because if p violates a relation 
from 03, then this can be seen from the restriction of p to a finite subset of the domain. 
Hence, p G 

For the implication from (2) to (3), first note that canonical structures 03 for & 
are homogeneous: when i is an isomorphism between finite substructures of 03, say 
i has domain {ax, . . . ,a n }, consider the relation {(p(ai), . . . ,p{a n )) \ p G This 
relation is preserved by all operations in 3 s and hence belongs to the relations of 03. 
Thus, i preserves this relation, and (i(ai), . . . , i(a n )) — {p(a\), . . . ,p{a n )) for some 
p G This shows that there is an automorphism of 03 that extends i. In fact, since 
& is locally closed, this also shows that every automorphism of 03 is from & . 

The implication from (3) to (1) is trivial. □ 

3.3.2. The Inv-Aut Galois Connection. When 03 is a relational structure, 
we denote by (03) f the set of all relations that are first-order definable in 03. We will 
see in this section that the set 

{(03)f o | 03 first-order definable in £} , 



2 Here, we slightly deviate from the definition given in 65 , which only includes a fc-ary relation 



for each orbit of fc-tuples, for all k. The difference doesn't matter here, but becomes important in 
later sections. 
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partially ordered by inclusion, is closely connected to the set of all locally closed 
permutation groups that contains the automorphisms of C, again partially ordered by 
inclusion; the connection is strongest when €. is w-categorical. 

Recall that the automorphism group of a relational structure *B, i.e., the set of 
all automorphisms of 05, is denoted by Aut (03). In the following it will be convenient 
to define the operator Aut also on sets 1Z of relations over the same domain B, in 
which case Aut (1Z) denotes the set of all permutations a of B such that a and its 
inverse a~ l preserve all relations form TZ. This convention is clearly consistent in the 
sense that Aut (03) equals Aut(7?.) when 58 is a relational structure whose relations 
are exactly the relations from 1Z. 

Definition 3.3.3. An (anti-tone) Galois connection is a pair of functions F : 
U — > V and G : V — > U between two posets U and V , such that v < F(u) if and only 
if u < G(v) for all u <E U,v G V . 



It follows immediately from F(u) < F(u) and Definition 3.3.3 that u < G(F(u)) 
for all u £ U, and similarly that F(G(v)) > v for all v S V. Moreover, F(u) = 
F(G(F(u))) and G(v) = G(F(G(v))) for all u € U, v € V. The following is obvious. 

Proposition 3.3.4. The operators Inv and Aut form a Galois connection between 
sets of relations over the base set B, and subsets of B — » B, both partially ordered by 
inclusion. 

For arbitrary sets of permutations 5? of the domain B, and sets 1Z of rela- 
tions over the domain B, we now present descriptions of the closure operators £P H> 
Aut(Inv(^)) and 1Z H> Inv (Aut (7Z)). We start with the former. 

Definition 3.3.5. When & is a set of permutations of a set B, then the per- 
mutation group generated by is the smallest permutation group that contains 3? . 
Further, the local closure of 2? , denoted by is the smallest locally closed permu- 

tation group that contains £P . 



The following is not hard to show, and a special case of Corollary 1.9 in 183 
(which will be presented in full generality in Proposition 5.2.1 of Chapter [5]). 



Proposition 3.3.6. Let & be a set of permutations of a set B. Then for every 
permutation a of B , the following are equivalent. 

(1) a is in the local closure of the permutation group generated by 2? ; 
(2) 

(3) a G Aut(Inv(^)). 
In particular, 

Aut(Inv(^)) = 



Proof. The implication from (1) to (2) follows directly from the fact that Inv 
and Pol form a Galois connection (Proposition |3. 3.4 ). 



To show that (2) implies (3), let a be from <JP), and let R be from Inv(^). We 
have to show that p preserves R. Let t € R; since p is in the closure the group gen- 
erated by we have that at — ■ ■ {(ik{t)) ■ ■ ■) for some permutations /3i, . . . , flu 
from 3P. Since /3i, . . . , 0k preserve R, we have that at e R. 

Finally, to show that (3) implies (1), let p be from Aut(Inv( &)). Let & be the 
group generated by It suffices to show that for every finite subset {oi, . . . , a n } 
of B there is a £ <S such that a(a^) = 0{a i ) for all i < n. Consider the relation 
{(0(ti), . . . , 0(t n )) | G ^}. It is preserved by all permutations in & . Therefore, a 
preserves this relation, and so there exists £ & as required. □ 
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We now turn to characterizations of the hull operator 03 H> Inv(Aut(03)). We 
first observe the following, which is straightforward to prove. 

Proposition 3.3.7. Let 03 be any structure. Then Inv(Aut(03)) contains (03) fo , 
the set of all relations that are first-order definable in 03. 

An exact characterization of 03 i-> Inv(Aut(03)) can be given when 03 is cu- 
categorical. The analogous statement of Proposition |3.3.8| below has been observed 
for finite structures 03 by Krasner 132| . The fact that it extends to w-categorical 



structures is a direct consequence of the Theorem of Ryll-Nardzewski (Theorem 3.1.4) 



Proposition 3.3.8. Let 03 be a countable ui- categorical structure with base set 
B, and let R C B k be a relation. Then R is first-order definable in 03 if and only if 
R is preserved by the automorphisms of*B, in symbols, 



Inv(Aut(03)) = (03; 



fo 



As we have seen in the proof of Proposition 3.3.2| it follows in particular that 



the expansion of every w-categorical structure by all first-order definable relations is 
homogeneous. Recall that Theorem |3.1.4| even states that for countable structures 
the conclusion in Proposition |3 . 3. 8] holds if and only if 21 is w-categorical. 

We have the following consequence of Proposition |3.3.6| and Proposition |3.3.8| 
An anti-isomorphism between two posets U and V is a bijection / from the elements 
of U to the elements of V such that u < v in U if and only if f(u) > f(v) in V. 

Corollary 3.3.9. Let € be a countable uj- categorical structure. Then we have 
the following. 

• The set of sets of the form (03) f OJ where 03 is first-order definable in £, 
ordered by inclusion, forms a lattice. 

• The set of locally closed permutation groups that contain Aut(C), ordered by 
inclusion, forms a lattice. 

• The operator Inv is an anti-isomorphism between those two lattices, and Aut 
is its inverse. 

We explicitly state another consequence. Recall that two structures 03, £ on the 
same domain are said to be first- order interdefinable iff all relations of 03 have a 
first-order definition in £ and vice-versa. Then it follows from the above that two uj- 
categorical structures are first-order interdefinable if and only if they have the same 
automorphisms . 

3.3.3. Transitivity and Primitivity. We define concepts from permutation 
group theory that will be needed later. A permutation group Slona set B is 

• k-transitive if for any two fc-tuples s, t of distinct elements from B there is an 
a £ £f such that as — t, where the action of a on tuples is componentwise, 
i.e., a(si, ...,Sfe) = (ctsi, . . . , asfe). We say that if is transitive if it is 1- 
transitive. 

• k-set transitive if for any two sets S, T C B of cardinality k there is an a G §f 
such that aS = {as \ s € S} = T. 

It is easy to see that a 2-set transitive group is also transitive. We prove the 
contraposition: assume that 03 has more than one orbit. There must be an orbit O 
with two distinct elements Ci,c 2 . Let c 3 be an element not from O. Then there is 
no automorphism that maps {01,02} to {01,03}, and hence 03 is not 2-set transitive. 
More generally, it holds that the number of orbits of n-subsets is a non-decreasing 
sequence [65] . 
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A congruence of Sf is an equivalence relation on B that is preserved by all per- 
mutations in <S . The equivalence classes of a congruence are also called blocks. A 
congruence is trivial if each block contains only one element (and non-trivial other- 
wise), and it is called proper if it is distinct from the equivalence relation that has 
only one block. When 23 is an w-categorical structure and 'S its oligomorphic au- 
tomorphism group, then the congruences of Sf are exactly the first-order definable 
equivalence relations in 23 (and so we apply the terminology that we have for congru- 
ences also to those equivalence relations) . A permutation group 'S is called primitive if 
is transitive and every proper congruence of §f is trivial, and imprimitive otherwise. 
Clearly, 2-transitive structures are always primitive. 

An orbital is an orbit of pairs, that is, a set of the form {(era, ab) \ a £ Sf} for 
a,b £ B. The trivial orbital is the orbital {(a,o) | o € B}. When O is an orbital, the 
orbital graph is the directed graph with vertex set B and edges O. The rank r(f$) of 
is the number of distinct orbitals of . 

For a sequence a of elements of B, the pointwise stabilizer ^ a of Sf is the set of 
all elements of Sf that fix a. For a subset A of B, the setwise stabilizer <& A of §? is the 
set of all elements a of Sf that fix A set-wise, that is, satisfy aA = A. 

3.3.4. Products. In this section we review the classical theory how to describe 
a permutation group in terms of transitive ones. The same idea can be used to 
construct new oligomorphic permutation groups from known ones. 

3.3.4.1. Group actions. It will be convenient to take a more general perspective 
on permutation groups (and this will again be used in Chapter [7]) . We now consider 
abstract groups, that is, algebraic structures G over a set G of group elements, with 
a function symbol for multiplication of group elements, a function for the inverse of a 
group element, and the constant for the identity. The link to permutation groups is 
given by the concept of an action of such a group on a set, which is described below. 

Let Sym(X) be the abstract group whose domain is the set of all permutations 
of X, and where composition is defined as composition of permutations, the inverse 
of an element g of Sym(A) is the inverse of g as a permutation of X, and the identity 
is the identity permutation. 

Definition 3.3.10. A (left) group action of an (abstract) group G on a set X 
is a binary function ■ : G x X — > X which satisfies that (gh) ■ x — g ■ (h ■ x) for all 
j,/ieG and i£S, and e • x = x for every x £ X . The action is faithful if for any 
two distinct g,h € G there exists an x £ X such that g ■ x =/= h ■ x. 

Equivalently, a group action of G on a set X can be seen as a homomorphism 
from G into Sym(A), and a faithful group action is an isomorphism between G 
and a subgroup of Sym(X). Clearly, to every action of G on X we can associate a 
permutation group as considered before, namely the image of the action in Sym(JC). 
Conversely, to every permutation group G on a set X we can associate an abstract 
group G whose domain is G, where composition and inverse are defined in the obvious 
way, and which acts on X faithfully by g ■ x = g{x). When 58 is a structure, we call 
G the abstract automorphism group of 23 if there is an action of G on B such that 
the image of G under this action is the automorphism group of 03. 

When x £ X, the orbit of x with respect to an action of G on X is the set 
{g-x | g £ G}. Hence, an orbit ofk-tuples in the corresponding permutation group on 
X is nothing but an orbit of the action of G on X k that is defined componentwise, that 
is, g maps (x\, . . . , xjA to (gx%, . . . , gxy). In this way we can also use other terminology 
introduced for permutation groups (such a transitivity, congruences, primitivity, etc.) 
for group actions. 
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The product of a sequence of groups (Gi) ie j is the product of this sequence as 
defined in general in Chapter [2j note that the product is again a group. Products 
appear in several ways when studying permutation groups; the first is when we want 
to describe the relation between a permutation group and its 'transitive constituents', 
described in the following. 

3.3.4.2. The intransitive action of a group product. When G acts on a set X and 
S C X is an orbit with respect to this action, then G naturally acts transitively on 
S by restriction; we call the corresponding group H the group induced by S, or a 
transitive constituent. 



Proposition 3.3.11 (see [65 ). Let G be a group acting on a set X, and let 
(Gj)j 6 j be the groups induced by the orbits of G on X . Then G is isomorphic to 
a subgroup of\\ ieI Gj, and there are surjective homomorphisms from G to G,, for 
each i. 

We can use the same idea to construct new oligomorphic permutation groups 
from known ones. 

Definition 3.3.12. Let Gi and G2 be groups acting on disjoint countable sets X 
and Y , respectively. Then the action of Gi x G2 on XL)Y defined by (<jfi, g^)- z — g±z 
if z £ X and giz if ' y £ Y is called the natural intransitive action of Gi x G2 on 
XUY. ' 

Note that when Gi and G2 act oligomorphically on X and Y, respectively, then 
the natural intransitive action of Gi x G2 is also oligomorphic. This can be seen from 



the theorem of Ryll-Nardzewski (Theorem 3.1.4): when Fx(n) is the number of orbits 



of the componentwise action of Gi on X n , and ^(n) is the number of orbits of the 
componentwise action of G2 on Y , then the number of orbits of the componentwise 
of Gi xGjonlUY is Ylo<i< n ^lM^M" — i), and hence finite for all n. 

When Gi and G2 are the automorphism groups of w-categorical relational struc- 
tures 21 and 03 with disjoint domains A and B, respectively, then the image of the 
natural intransitive action on AuB (as a homomorphism from Gi x G2 to Sym(AuB)) 
can also be described as the automorphism group of a relational structure £: we can 
take for £ the disjoint union of 21 and 03 (defined in Section [TTT] ) , expanded by a unary 
predicate that contains exactly the elements of A. Since reducts of w-categorical struc- 
tures are again w-categorical, this shows in particular that the disjoint union of two 
w-categorical structures is again w-categorical. 

3.3.4.3. The product action. When Gi is a group acting on a set X, and G2 a 
group acting on a set Y, there is another important natural action of G := Gi x 
G 2 besides the intransitive natural action of G, which is called the product action 
of G. In this action, G acts on X x Y by (31,32) ■ (x,y) — {g\x,g\y). If the 
actions of Gi and G2 are transitive, then the product action is clearly transitive, too. 
We claim that when the actions of Gi and G2 are oligomorphic, then the product 
action is also oligomorphic. Let F\(n) and ^(rz.) then the number of orbits of the 
componentwise action of G\ on X n and Y n , respectively. Then the number of orbits 
of the componentwise action of G on X x Y is F\(n)F2(n), and in particular finite, 
which proves the claim. 

When Gi and G2 are the automorphism groups of w-categorical structures 21 and 
03, then the image of the product action of G in Sym(^4 x B) is the automorphism 
group of the following structure, which we call the full product structure of 21 and 03, 
and denote by 21 M 03. Let a be the signature of 21, and t be the signature of 03; 
we assume that a and t are disjoint, otherwise we rename the relations so that the 
assumption is satisfied. For each fc-ary R £ a, the structure 21K103 contains the relation 
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{((ai,6i), . . . , (a k ,b k )) | (eti, . . . ,a k ) G i? a ,6i, . . . , 6 fc G P}, and for each fc-ary JJer, 
it contains the relation {((oi, 61), ... , (a*,, &&)) | (61, . . . , 6fc) € i? 9 *, Oi, • • • , Oft G ^4}- 
Finally, we also add the relations Pi = {((oi, 61), ((X2, 62)) I &i = 02} and P2 = 
{((01,60,(02,62)) I 61 = 6 2 } to 21 H 25. 

Proposition 3.3.13. The automorphism group 0} € := 21 Kl 23 is G : x G 2 in ite 
product action on A x P. 

Proof. Let /i be the product action of G = Gi x G2 on A x B, viewed as 
a homomorphism from G to Sym(A x B). Let (31,32) be an element of G. Then 
M(ffi;S2)) is the permutation (x,y) i-> (31X, 32Z/) of i x B, and this map preserves 
£: when ((a 1; 61), . . . , (a fc , 6&)) G P c , for R G a, then (ai,...,afc) G i? 21 , and so 
(ffiffli, • ■ ■ ,3i a fc) € P 21 - Therefore, ((3101,3261), . . . , (gia k ,g 2 b k )) € P C - The proof for 
the relation symbols P G r is analogous. 

Wc now show that conversely, every automorphism 3 of £ is in the image of h. Let 
Note that Pi and P2 are congruences of the automorphism group of £. Fix elements 
a G A, 6 G B. Let 31 be the permutation of A that maps a G A to the point a' 
such that h((a,bo)) — (a',b'). Similarly, let 32 be the permutation of B that maps 
6 G B to the point 6' such that h((ao,b)) — (a',b'). Since 3 preserves Pi,P2, the 
definition of 31 and 32 does not depend on the choice of a and 60. Moreover, 31 is 
from Gi, since 3 preserves the relations for the symbols from a. Similarly, 32 is from 
G 2 . Then 3' := h((g 1 ,g 2 )) equals 3, since g'((a,b) = (gia,g 2 b) = 3(0,6). Hence, 3 is 
a permutation of A x B that lies in the image of h. □ 

Note that Proposition |3.3. 13| becomes false in general when we omit the relations 
Pi and P2 in 2t 23. Consider for example the structure without structure 23 (that 
is, 23 has empty signature). Then the automorphism group of 23 23 is imprimitive, 
but without the relations Pi and P2, the structure is isomorphic to 23 and hence 
primitive. Also note that when 21 and 23 are ordered structures (and this will be a 
typical assumption in Chapter [8]), we could omit Pi and P2 in the definition of the 



full product without sacrificing Proposition 3.3.13 since Pi(x, y) is definable from the 



order < of 21 by the formula ->(x < y) A < x), and similarly P2 is definable from 
the order of 23. 

Finally we remark that (21K123)K1£ and 21K1(23K1£) have the same automorphism 
group (on the domain A x B x C). We explicitly define the d-fold full product as 
follows. 

Definition 3.3.14 (Full product of d structures). Let 23i, . . . , 23 d be structures 
with disjoint relational signatures Ti, ... ,T<j. We denote by 23iKl- • -K123rf the structure 
with domain B := B\ x • • ■ x Bd that contains for every i < d, and every m-ary 
R G U {=}) an m-ary relation defined by 

{((xi,...,x^),...,(x^,...,x^))GP m | (xl,...,x^)GP^}. 

7/23 :=23i = ••• = 23 k , then we first rename R G Tj into Ri so that the factors have 
pairwise disjoint signatures, and then write 23^1 for 23 1 M ■ ■ ■ Kl 23^. 

When 21 and 23 have the same signature r, then the automorphism group of 



the r-structure 21 x 23 (see Definition 3.3.4) contains the automorphism group of 



21 x 23, and hence 21 x 23 is w-categorical, by Theorem |3.1.4| As a consequence, the 
class of all cj-categorical structures forms a lattice with respect to the homomorphism 
order (where disjoint union is the join, and product the meet of two w-categorical 
structures). 
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3.4. Preservation Theorems 



Model-theoretic preservation theorems typically link definability in (a syntacti- 
cally restricted fragment of) a given logic with certain 'semantic' closure properties. 
For the syntactic restrictions on first-order formulas that we have introduced in Chap- 
ter [2] we have already made remarks about various types of mappings that automat- 
ically preserve the respective formulas. Surprisingly, very often these maps can be 
used to obtain an exact characterization of definability in the corresponding fragment 
of first-order logic. 

In this text, preservation theorems become relevant in two contexts. The first 
is that they can be used to give exact characterizations of existential, existential 
positive, and quantifier-free definability of relations over an w-categorical structure, 
in a similar way as we characterized first-order definability in Section |3.3| These 
characterizations require that we pass from automorphism groups to endomorphism 
monoids, and they turn out to be useful for the complexity analysis of CSPs. The 
various relevant connections are displayed in Figure |3.1| 



first-order definitions 
existential definitions 
positive definitions 
existential positive definitions 
quantifier-free definitions 



automorphisms 
self-embeddings 
surjective endomorphisms 
endomorphisms 
partial automorphisms 



Figure 3.1. Syntactically restricted definabilities and the corre- 
sponding preservation properties. 

The second context where we encounter model-theoretic preservation theorems is 
when giving syntactic descriptions of w-categorical theories themselves (rather than 
relations in w-categorical structures). For instance, we will see that for every oj- 
categorical structure 21 there exists a homomorphically equivalent w-categorical struc- 
ture 25 whose first-order theory is V3 + . 

3.4.1. Model-theoretic preservation theorems. When T is a first-order the- 
ory, we say that </> and if> are equivalent modulo T if T |= ip) (see Section 1.3). 
The following theorems are well-known and can be found in most model theory books. 



Theorem 3.4.1 (Los-Tarski; see e.g. Corollary in 5.4.5 of |114 ). Let T be a first- 
order theory. A first-order formula <j) is equivalent to an existential formula modulo 
T if and only if <f> is preserved by all embeddings between models ofT. 

Theorem 3.4.2 (Lyndon; see e.g. Corollary in 8.3.5 of [114] ). Let T be a first- 
order theory. A first-order formula 4> is equivalent to a positive formula modulo T if 
and only if <p is preserved by all surjective homomorphisms between models of T . 

Note that here the assumption that _L is always part of first-order logic becomes 
relevant: the first-order formula 3x. x 7^ x is preserved by all homomorphisms between 
models of T, but without _L it might not be equivalent to a positive formula modulo 
T (for instance when T is the empty theory). 

Theorem 3.4.3 (Homomorphism Preservation Theorem; see e.g. Exercise 2 in 
Section 5.5 of 114) ). Let T be a first-order theory. A first- order formula <fi is equiv- 
alent to an existential positive formula modulo T if and only if 4> is preserved by all 
homomorphisms between models of T . 



Theorem 3.4.4 (Chang-Los-Suszko Theorem; Theorem 5.4.9 in 114 and re- 
marks after the proof). Let T be a first-order t -theory. 
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A set of first-order T-formulas <I> is equivalent to a set of ^B-formulas \& 
modulo T if and only if $ is preserved in unions of chains {Ai) whenever 
IJAj and all the Ai are models ofT. 

A first- order T-formulas <j) is equivalent to a^B-formulas tp modulo T if and 
only if (f> is preserved in unions of chains (Ai) whenever (J Ai and all the Ai 
are models of T. 



Our next preservation theorem, Theorem 3.4.6 is a positive variant of the Chang- 
Los-Suszko preservation theorem, which we could not find in explicit form in the 
literature. Its proof can be derived from the proof of the Chang-Los-Suszko theorem 



given in 114 by modification of a sequence of lemmata given there; since some of 



them require some care, we will present those modifications in full detail here. Besides 



the existential positive amalgamation theorem (Lemma 2.4.4), we need the following 
lemma. 

Lemma 3.4.5. Let T be a first-order theory, and let 21 be a model of the V3 + - 
consequences of T . Then 21 can be extended to a model 03 of T such that every 
existential positive formula that holds on a tuple a in 03 also holds on a in 21. 

Proof. Let 21' be an expansion of 21 by constants such that all elements of 21' are 
denoted by a constant symbol. It suffices to prove that T U diag(Sl') U diag v - (21') has 
a model 03. Suppose for contradiction that it were inconsistent; then by compactness, 
there exists a finite subset U of diag v - (21') U diag(2l') such that TU U is inconsistent. 
Let <f> be the conjunction over U where all new constant symbols are existentially 
quantified. Then T U {(f)} is inconsistent as well. But -xf) is equivalent to a V3 + 
formula, and a consequence of T. Hence, 21 |= -«f>, a contradiction. □ 

The following is a positive version of the Chang-Los-Suszko theorem (Theo- 



rem 3.4.4) 



Theorem 3.4.6. LetT be a first- order r -theory, and $ a set of T-formulas. Then 
the following are equivalent. 

(1) $ is modulo T equivalent to a set of 'V3 + -formulas \& . 

(2) $ is preserved in direct limits of sequences of models of T ; 

(3) $ is preserved in direct limits of countable sequences of models of T . 



Proof. The implication from (1) to (2) is Proposition 3.5.7 The implication 
from (2) to (3) is trivial. For the implication from (3) to (1), assume that (j> is 
preserved by direct limits of sequences (21.;) as in the statement of the proposition. 



We can assume that <& is a set of sentences (by adding constants, Lemma 2.3.2). Let 
<I> be the set of all V3 + -sentences that are consequences of T U <&. We first show that 
TU ^ implies <j>. It suffices to show that every model of TU \I/ is elementary equivalent 
to a direct limit of a sequence (Q3i)i< w of models of T U $ where there are coherent 
homomorphisms fij : 25^ — > *Bj with fjk o = f ik for all i < j < k. 

To construct this sequence, we define an elementary chain of models (2li)j< a; of 
TUf such that there are 

• homomorphisms fi : 21^ — > 03 j, with 03^ |= T U such that for every 
tuple <Zj of elements from 2li and every existential positive formula 0, if 
23, h= 0(/i(5i)), then 21, |= <9(a t ), and 

• homomorphisms gi : 03^ — > 2ti+i, such that gi o is the identity on 21;. 
Let 2l be a countable model of T U To construct the rest of the sequence, 

suppose that 21; has been chosen. Since 21q is an elementary substructure of 2tj, in 



particular all the V3 + -consequences of TU$ hold in 2lj. By Lemma 3.4.5 the structure 



21; can be extended to a model 03; of T U $ such that every ep-sentence that holds 
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in (93j,<ij) also holds in (2tj,ejj). By Lemma 2.4.4 there are an elementary extension 
2li+i of 2ti and a homomorphism gi : 03,; — » 2ti+i such that gi o /j is the identity on 
2lj. Then £ := lj i<w 2ti equals limi <w 05;, and by the Tarski-Vaught elementary chain 



theorem (Theorem 3.5.3) 2lo is an elementary substructure of £. So £ is a model of 
T, and the direct limit of models 03^ of T U and hence £ \= <f). This shows that 
TUf implies $. □ 

By compactness one can show that when $ is finite, then the formula ^ from 



item (1) in Theorem 3.4.6 above can be chosen to be finite as well. 



3.4.2. Endomorphisms and self-embeddings. We apply the model-theoretic 
preservation theorems from the previous section to characterize existential, positive, 
and existential positive definability of relations in aj-categorical structures. 

For finite structures and existential positive definability the corresponding preser- 
vation theorem has already been noted by Krasner 132| (for finite structures, self- 



(2) 
(3) 



embeddings are necessarily automorphisms, and existential definability is the same as 
first-order definability) . 

Theorem 3.4.7 (from [22] and 36[|46| ). Let 05 be an uj -categorical structure with 
base set B, and R C B k be a relation. 

(1) R has an existential definition in 05 if and only if R is preserved by all 
self-embeddings o/05. 

A non-empty relation R has a positive definition in 03 if and only if R is 
preserved by all surjective endomorphisms o/03. 

A non-empty relation R has an existential positive definition in 05 if and 
only if R is preserved by all endomorphisms of 03 . 

Proof. We have already remarked in Chapter [2] that existential positive for- 
mulas are preserved by endomorphisms, and existential formulas are preserved by 
self-embeddings of 03. 

For the other direction, note that the endomorphisms and self-embeddings of 
03 contain the automorphisms of 03, and hence Theorem 3.1.4 shows that R has a 
first-order definition in 03. Suppose for contradiction that R were preserved by 
all endomorphisms of 03 but has no existential positive definition in 03. We use the 
homomorphism preservation theorem. Since by assumption <p is not equivalent to 
an existential positive formula in 03, there are models 03i and 052 of the first-order 
theory of 03 and a homomorphism h from 03 1 to 052 that violates <f>. By the theorem of 
Lowenheim-Skolem (Theorem 2.3. 3[ ) the first-order theory of the two-sorted structure 
(03i,03 2 ;/i) has a countable model ($8' x , 93 2 ; h'). Since both OS^ and 03 2 must be 
countably infinite, and because 03 is w-categorical, we have that 03^ and 05' 2 are 
isomorphic to 03, and h' can be seen as an endomorphism of 03 that violates <p; a 
contradiction. 

The argument for existential definitions and positive definitions is similar, but 
instead of the homomorphism preservation theorem we use the theorem of Los-Tarski 
(Theorem 3.4.11 and Lyndon's theorem (Theorem 3.4.2). □ 



We now present a Galois connection for existential positive definability and trans- 
formation monoids, similar to the Galois connection for first-order definability and 
permutation groups. For a structure 05, we denote the set of relations with an existen- 
tial positive definition in 05 by (03) cp . Similarly as in Section 3.3 we say that a set of 
operations & C (B — > B) is (locally) closed if it contains every operation / : B — > B 
such that for every finite subset A of B there exists a g £ ^ such that /(a) = g(a) 
for all a e A. The closure of & is the smallest locally closed set of operations that 
contains & '. 
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When & is a transformation monoid, then (J£") denotes the smallest locally closed 
transformation monoid that contains & '. The set of endomorphisms of a relational 
structure 03 (or the set of operations from B — > B that preserve a set of relations 
TZ over the domain B) is denoted by End(03) (or by End(7£), respectively). The 
following can be shown in a similarly straightforward way as Proposition |3.3.2| 

Proposition 3.4.8. For every & C [B — > B), the following are equivalent. 

(1) ^ is the transformation monoid of a relational structure; 

(2) & is a locally closed monoid. 

The proof of the following statement is similar to the proof of Proposition |3.3.6| 

Proposition 3.4.9. Let & C (B — > B) be a transformation monoid. Then 
g: B — >• B is in the closure of & if and only if g preserves all relations in Inv(^"). 
In symbols, 

End(Inv(J*")) = (=T) . 

Theorem |3.4.7| now implies the following analog to Corollary |3.3.9| 

Corollary 3.4.10. Let £ be an uj- categorical structure. Then the lattice of locally 
closed transformation monoids that contain Aut(£) is anti-isomorphic to the lattice 
of sets of the form (03) cp where 03 is first-order definable in £. 

To illustrate the use of this Galois connection, we present a simple and typical 
application; see Corollary |5. 3. 8| to see consequences of this observation for constraint 
satisfaction. 

Lemma 3.4.11. Let 03 be such that Aut(03) is 2-set transitive. If 03 has a non- 
injective endomorphism f , then 03 also has a constant endomorphism. 

Proof. Let / be an endomorphism of 03 such that f(b) = f(b') for two distinct 
values b, b' 6 B. Let foi , 62 , - - - be an enumeration of B. We construct an infinite 
sequence of endomorphisms ei,e2,..., where ej is an endomorphism that maps all 
of the values 61, . . . , &j to b\. This suffices, since then by local closure the mapping 
defined by e(x) := bi for all x is an endomorphism of 03. 

For e\, we take the identity map, which clearly is an endomorphism with the 
desired properties. To define e; for i > 2, let a be an automorphism of 03 that 
maps {61, ei_i(6j)} to {b, b'}; such an automorphism exists because Aut(03) is 2-set 
transitive. Then the endomorphism /(aej_i(a;)) is constant on b\,. .. ,6^; recall that 
bi = ej_i(&i) = . . . = e,-_i(&i_i). Since 03 is 2-transitive, it is in particular transitive, 
and there is an automorphism j3 that maps f(b) to 61. Then a : x <— > /?/(aei_i(x)) is 
an endomorphism of 03 with the desired properties. □ 

3.4.3. Locally invertible self-embeddings. Let 21 and 03 be r-structures, let 
e be an embedding of 21 into 03, and let / be an embedding of 03 into 21. We say that 
e and / locally invert each other if 

• for every tuple a of elements of 21 there are fj £ Aut(03) and a € Aut(2l) 
such that af(j3e(a)) = a, and 

• for every tuple b of elements of 03 there are a £ Aut(2l) and j3 G Aut(03) 
such that /3e(af(b)) = b. 

We say that e is locally invertible if there exists a self-embedding / such that e and 
/ locally invert each other. 

We will show that locally invertible self-embeddings preserve first-order formulas. 
To do so, we need the following concept. Let 21 and 03 be r-structures. A back-and- 
forth system from 21 to 03 (our definition is taken from 114| ) is a non-empty set I of 



pairs (a, b) of tuples, with a from 21 and b from 03, such that the following hold. 
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(1) If (a, b) £ I then a and b have the same length and (21, a) satisfies the same 
atomic formulas as (03, 6). 

(2) (Going Forth.) For every pair (a, 6) £ I and every element c of 21 there is 
an element d of 25 such that the pair (ac, bd) £ I. 

(3) (Going Back.) For every pair (a, b) £ I and every element d of 25 there is 
an element c of 21 such that the pair (ac, bd) £ I. 

There is a back-and-forth system from 21 to 25 if and only if 21 and 25 are isomorphic 



(combination of Lemma 3.2.2 and Theorem 3.2.3 (b) in 114 ) 



Theorem 3.4.12. A relation R has a first-order definition in an uj- categorical 
structure 25 if and only if R is preserved by all locally invertible self- embedding s o/25. 

Proof. We are in the remarkable situation (in comparison to the other preser- 
vation theorems discussed here) that the "if direction of the statement is easy (it 
follows directly from the theorem of Ryll-Nardzewski, since automorphisms are locally 
inverted by their inverse), and that we only have to show the 'only if direction. 

Let e and / be self-embeddings of 25 that locally invert each other, and suppose 
that a is a tuple from 25 that satisfies a first-order formula <j>. We claim that e(a) 
satisfies 4> as well. It clearly suffices to show that the structures (25, a) and (25, e(a)) 
are isomorphic. We claim that the set 

I := {(u,v) | there are 7,(5 £ Aut(25) 

so that 8e^(u) = v} 

is a back-and-forth system from (25, a) to (25,e(a)). 

The set / is non-empty, since (a, e(a)) £ I (we have 7 = 5 = id in the definition 
of /). It is obvious that I satisfies item (1) in the definition of back-and-forth systems 
since all involved operations are embeddings. Now, let (u, v) be from /. By definition 
of /, there are 7 £ Aut(25) and 8 £ Aut(25) so that 8e(^u) = v. For going forth, let 
c be an arbitrary element of 25. Let d be 8e(jc). The clearly (uc, vd) £ I. 

For going back, let d be an arbitrary element of 25. Since e is locally inverted by 
/, there exist a, (3 £ Aut(25) such that a/(/3e(7u)) = ju. Since e(ju) — S^ 1 ^, this is 
the same as saying that af((3S~ 1; D) — ju, and by multiplication with a" 1 we note 

ftfS- 1 *) = a-'ju . (7) 

We now set c to 7™ 1 a/(/3<5~ 1 d), claiming that (uc, vd) £ I, which completes the proof. 

To show the claim, we have to find 7', §' £ Aut(25) such that 8' e^' (uc)) = vd. 
Let p be the tuple fiS^ivd). By the second item in the definition of local inversion, 
there are a',/3' G Aut(25) such that (3'e(a'f(p)) = p. 

Choose 7' = a'aT 1 -) and 6' = (5/3" 1 /?'. Then 

5'e(i(uc)) = 6(3- 1 f3 , e(a , a- 1 j(uc)) 

= dp- 1 p'e(a'(a- 1 "fu, a -x 7c)) 

= 5p- l P'<ot'(f(p8- 1 v)J{p5- l d))) (see @) 

= 5p- 1 P'e(a'f(p5- 1 (vd))) 
= 8p- l p8- l (vd) 
= vd , 

and so (uc, vd) £ I. □ 

Theorem |3 . 4 . 1 2 1 will be used in Section [3~6} and later also in Chapter [T0| as a tool 
for proving that all automorphisms of certain structures 25 are locally generated by 
the self-embeddings of 25. We note the following consequence of Theorem 3.4. 12| 
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Corollary 3.4.13. An endomorphism e of an uj- categorical structure is locally 
invertible if and only if e is locally generated by the automorphisms ofT. 

Proof. If e is locally generated by the automorphisms, then e is clearly locally 
invertible. The converse follows from Theorem |3.4.12| in combination with Proposi- 
tion m □ 

3.4.4. Partial Automorphisms. Recall that a formula is called quantifier-free 
if it can be constructed from atomic formulas by usage of Boolean connectives only. 
Also quantifier-free definability can be characterized by a model-theoretic preservation 
theorem; in this case, this is very easy to prove. 

Proposition 3.4.14. Let T be a first-order theory. A first-order formula <f> is 
equivalent to a quantifier-free formula modulo T if and only if (f> is preserved by partial 
isomorphisms between models ofT. 

Proof. It is clear that quantifier-free formulas are preserved by partial isomor- 
phisms between models of T. For the converse, let 4> be preserved by all partial 
isomorphisms between models of T. Let 'J be the (finite) set of all quantifier-free 
formulas ip such that T \= \/x(cj)(x) ip{x)). It suffices to prove that ^ implies <f>. 
Let 21 be a model of T and a a tuple from 21 such that a satisfies \& in 21. Let 23 be 
a model of T and b a tuple from 23 such that b satisfies in 23 (if no such 25 exists, 
the statement of the proposition is trivial). Since a and b satisfy the same atomic 
formulas, the mapping that sends a to b is a partial isomorphism. Since tfj is preserved 
by partial isomorphisms, b satisfies ip in 21 and we are done. □ 

The following can be be derived from the previous proposition. 

Proposition 3.4.15. Let 25 be an co- categorical structure. Then a relation R 
has a quantifier-free definition in 23 if and only if R is preserved by all partial au- 
tomorphisms of*B, i.e., preserved by isomorphisms between induced substructures of 
23. 

Proof. If a relation R is preserved by all partial endomorphisms of 23, then it 
is in particular preserved by all endomorphisms of 23 . By Theorem |3.4.7 R has an 



existential positive definition in 23. We can therefore use Proposition 3.4. 14| in the 



same way as we used model-theoretic preservation theorems to prove Theorem |3. 4. 7| 
to conclude the argument. □ 



3.5. Existential Positive Completion 

It might be that the same CSP can be formulated with different templates. Con- 
sider for example the random graph (V; E) , which has exactly the same CSP as the 
template (N; {{x 7 y) \ x ^ y}). Recall that two structures 23, £ have the same CSP if 
and only if they have the same existential positive theory T, or equivalently, if they 
have the same universal negative theory S. Moreover, any model of T U S has the 
same CSP as 23 and <£. The topic of this section is how to produce a model 23 of 
T(J S such that 23 has many good properties for studying CSP(23). Good candidates 
for such models 23 are existential-positively closed models, which will be introduced 
here. Much of the material presented here is analogous to the classical facts about 
existential completion, which we briefly review in Section |3.5.1| We then discuss ex- 
istential positive completion in Section |3.5.2| The results in this section have been 



published in 33 . 
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3.5.1. Existential Completion. Let T be a first-order theory. A model 21 of 
T is existentially closed for T (we also say that 21 is an existentially closed model of 
T) if 21 f= 4>(a) for any embedding e from 21 into another model 03 of T, any tuple a 
from A, and any primitive formula <f> with 58 |= </>(e(a)). 

To construct existentially closed models of T, we use unions of elementary chains. 
Let (2li)i<« be a sequence of T-structures for a relational signature t. Then (2L)i< K 
is called a chain if 21^ C 2lj for all i < j < k. A chain is called an elementary chain 
when for all i,j < k, the extension 2tj of 21^ is elementary. 

Definition 3.5.1. The union of the chain (2lj)i< 7 is a r-structure 03 defined as 
follows. The domain of 03 is Ui< 7 ^; an d f or each relation symbol R £ t we put 
a £ i? B if a £ /or some (or all) Qli containing a. 

We say that a formula <fi is preserved in chains ( of models of T) if for all chains 
(2li)i<« of r-structures (where all the 2lj and 21 := [J i<K ^i are models of T), and 
every finite n-tuple a of elements of A we have 21 |= <j)(a) whenever 2lj |= <fi(a) for 
every i < k. A first-order theory T is inductive if the union of every chain of models 
of T is also a model of T. 



Proposition 3.5.2 (Theorem 2.4.4 in 114| ). Every V3- formula is preserved 
unions of chains. 

In particular, when T is a V3-theory, then T is inductive. 



Theorem 3.5.3 (Tarski-Vaught; Theorem 2.5.2 in 114 ). Let (2lj),< K be an ele- 
mentary chain of T-structures. Then {J i<K 2L is an elementary extension of each 2lj, 
i < k. 

The following lemma implies that if T is inductive, then it has an existentially 



closed model. For a proof, see 114 , or the proof of Lemma 3.5.8 below, which is 
very similar. 



Lemma 3.5.4 (Corollary 7.2.2 in 114 ). Let T be an inductive theory and let k 



be an infinite cardinal. Then any model 21 of T of cardinality at most k embeds into 
an existentially closed model 03 of T of cardinality at most k. 

3.5.2. Existential positive completion. Again, let T be a first-order theory. 

Definition 3.5.5. A model Ql ofT is existential-positively closed for T (or short 
an epc model oiT) if% \= 4>{a) for any homomorphism h from 21 into another model 
03 ofT, any tuple a from A, and any existential positive formula <p with 03 |= <j){h(a)). 

Note that we can equivalently replace 'existential positive' by 'primitive positive' 
in the previous definition. To show the existence of epc models we apply the direct 
limit construction. Direct limits can be seen as a positive variant of the notion of a 
union of chains; we essentially replace embeddings in chains by homomorphisms. 

Let r be a relational signature, and let 2lo, 2ti, • • • be a sequence of r-structures 
such that there are homomorphisms : 21, — > 2L, . Those homomorphisms are called 
coherent if f jk o f, LJ = f ik for every i < j < k. 

Definition 3.5.6. Let 2to,2li,... be a sequence of r-structures with coherent 
homomorphisms : 2ti — > %lj . Then the direct limit lim^^ 2L is the r-structure 
21 defined as follows. The domain A of 21 comprises the equivalence classes of the 
equivalence relation ~ defined on U i<UJ Ai by setting Xi ~ Xj for Xi £ A^Xj £ Aj iff 
there is a k such that fik(%i) = fjk{ x j)- Let <?j : Ai — > A be the function that maps 
a £ Ai to the equivalence class of a in A. For R £ r and a tuple a over A, define 
21 |= R(a) iff there is a k and a tuple b over A% such that 2t/c |= R(b) and a = gfc(o). 
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Note that the definition of \im i<tLl %i also depends on the coherent family f t j, but 
this is left implicit and will be clear from the context. Also note that gi defines a 
homomorphism from 21; to 21; this function is called the limit homomorphism from 
21; to the direct limit 21. Note that <?; = gj o /y for alH < j < u. We have seen that 
unions of chains preserve V3-formulas; the analogous statement for direct limits is as 
follows. We say that a first-order formula <t>(x\, ...,x n ) is preserved in direct limits 
(of models ofT) if for all sequences (2lj)j< re (where all the 21, and 21 := lim i<K 2l; are 
models of T) , and every n-tuple a of elements of A we have 21 \= <fr(a) whenever there 
is for every i < k an n-tuples a? where the j-th entry is a representative of the j-ih 



entry of a, and 21; |= 4>(a l ). The following is Theorem 2.4.6 in 113 ; since it is given 
there without proof, we present it here for completeness. 



Proposition 3.5.7 (Theorem 2.4.6 in 113 ). Every V3 + -formula is preserved in 
direct limits of models ofT. 

Proof. Let (21,),< K be a sequence of models ofT with coherent homomorphisms 
hij : 2lj — > 2tj, for i,j < k, such that 21 := lim; <K 2L is a model of T. Let gi be 
the limit homomorphism from 21, to 21. We have to show that if a is a tuple of 
elements of 21 such that for all i < K there is a, with gi(cLi) = a and 21; |= 0(5;), 
then 2t |= <fr(a). Since <j> is V3 + , we can assume that 4>(x) is of the form Vy. 4>' (x,y) 
where </>' is a disjunction of negated atomic formulas and existential positive formulas. 
Suppose that 4>'(a, b) is false in 21 for some tuple b of elements of 21. Every disjunct 
ip of cj)'(a,b) is false in 21. Then there exists an i < n such that all entries of b have 
representatives in 2tj, and the negated atomic disjuncts of 4>' are already false in 21^, 
by definition of direct limits. Let bi be a tuple of elements of 21; where the j-th entry is 
a representative of the j-th entry in b. Since 21; |= 0(a.;), there must exist a disjunct 
ip of (j)' such that tp{a,i,bi) holds in 21;. The limit homomorphism gi maps (a;,6j) 
to (a, b) and is a homomorphism from 21; to 21, and therefore preserves existential 
positive formulas, contradicting the assumption that <p'(a,b) is false in 21. □ 



Lemma 3.5.8 (from 33 ; also see 18 ). Let T be a V3 + theory and let k be 



an infinite cardinal. Then any model of % of T of cardinality at most n admits a 
homomorphism to an epc model 03 of T of cardinality at most k. 

Proof. Set 03o := 21. Having constructed 03^ of cardinality at most k, for i < u, 
let {(</>«) 5o) | a < k} be an enumeration of all pairs (</>, a) where <f> is existential 
positive with free variables x\, . . . ,x n , and a is an rt-tuple from Bi. We construct a 
sequence (55f )o<«<k of models of T of cardinality at most k and a coherent sequence 
(ft' a )o<n<a< K where /f ' a is a homomorphism from 03 f to 03 f , as follows. 

Set 03° = 03i_i. Now let a = /3+1 < k be a successor ordinal. Let bp be the image 
of ap in 03f under /° ,/3 . If there is a model £ of T and a homomorphism /i : 03^ — > £ 
such that €. \= </>p{h(bp)), then by the theorem of Lowenheim-Skolem there is also a 
model £' of cardinality at most k of T and a homomorphism hi : 03 f — > £' such that 
C |= 4>p(h'(fip)). Set 03? = C and /f' Q = ft' o /f' Q for all /i < a. Otherwise, if 
there is no such model £, we set 03" = 03 f and /f ' a = id (the identity mapping) and 
ft'° = /j A1 ' 9 - Finally, for limit ordinals a < k, set 03 f = lim M<Q 03 f and let /f be 
the corresponding limit homomorphism from 03^ to Sf. 

Let 03; be lim Q<K 03° and let g; : 58;_i —¥ 03^ be the limit homomorphism mapping 
03i_i = 03 i to its equivalence class in 03;. In the natural way, the gi give raise to a 



coherent sequence of homomorphisms, and by Proposition 3.5.7 03 := limj <u) 03; is a 
model of T; let hi : 03^ — > 03 for i <w be the corresponding limit homomorphisms. 

The structure 03 is epc in T. To verify this, let g be a homomorphism from 03 to 
a model £ of T, and suppose that there is a tuple b over £> and an existential positive 
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formula cj> such that £ |= <f>(g(b)). Then there is an i < lu and an a £ Bi such that 
hi(a) — b. Then go hi is a homomorphism from 03^ to £, and by construction we have 
that *Bi+i |= 4>{gi{a)). Note that hi + iog t = hi. Thus, since hi+\ preserves existential 
positive formulas, we also have that 03 |= </>(&), which is what we had to show. □ 

For an equivalent characterization of existentially closed models in terms of max- 



imal pp-types (Proposition 3.5.10), we need the following lemma, a close relative of 
Theorem 10.3.1 in |113| . 

Lemma 3.5.9. Lei 21 and 03 be r- structures, where^B is pp-\A\-saturated. Suppose 
that u < \A\ and that f is a mapping from {a a \ a < /i} C A to B such that all pp- 
(tU{c q I a < pi}) -sentences true on (21, (a a ) a<fJl ) are true on (03, (f(a a )) a<u ) . Then 
f can be extended to a homomorphism from 21 to 03 . 

Proof. If 03 is finite, then 03 is pp-|A|-saturated no matter what cardinality 
21 has. Suppose u < \A\. Let (a^,) a <|^| well-order A such that {a' a \ a < fi} = 
{a a | a < /u,} (there is the implicit and harmless assumption that (a a ) a<u contains no 
repetitions). Set (b a ) a<n := (f(a a )) a<u . 

We will construct by transfinitc induction on f3 (up to \A\) a sequence (b a ) a< j3 
such that we maintain the inductive hypothesis 

(*) all pp-(r U {c a | a < /3})-sentences true on (21; (a' a ) a< fj) are true on (*B; {b a ) a< p). 

• (Base Case.) f3 = pL. Follows from the hypothesis of the lemma. 

• (Inductive Step. Limit ordinals.) /? = A. Property (*) holds, since a sentence 
can only mention a finite collection of constants, whose indices must all be 
less than some 7 < A. 

• (Inductive Step. Successor ordinals.) (3 = 7+ < Set 

S := {(j>(x) I (f> is a pp-(r U {c a \ a < 7})-formula such that 

(21; {a' a ) a<1 ) • 

By (*), for every </> g E, (S; (b a ) a<1 ) |= 3x.(j)(x). By compactness, since E 
is closed under conjunction, we have that E is a pp-l-type of (*B; (b a ) a<1 ). 
Then E is realized by some element 6 7 £ B because 03 is pp-|A|-saturated. 
By construction we maintain that all pp-(r U {c a \ a < 7 + })-sentences true 
on (21; (a' Q ) Q<7 +) are true on (03; (b a ) a<J +). 
The result follows by reading / as the function that maps a' a to b a for all a < \A\. □ 

PROPOSITION 3.5.10. Let T be a theory, and let 21 be a model of T . Then 21 is 
epc for T if and only if every complete pp-n-type of ^21 is a maximal pp-type of T . 

Proof. (Forwards.) Suppose p{x\, . . . , x n ) is an pp-n-type, realized in 21 by the 
tuple (01, . . . , a n ). Let c±, . . . , c„ be new constant symbols that denote a±,. . . ,a n in 
21. Let 4>{x\, . . . , x n ) be a primitive positive formula such that T U p{c\, . . . , c„) U 
{4>(ci, . . . ,c„)} has a model (<£; c\, . . . , c n ). Now, let (£ sat ; ci, . . . , c„) be a n \A\- 
saturated model of Th(£; c\,..., c n ); such a model always exists by Theorem 2.5. 1| 



Clearly (£ sa t; c\, . . . , c„) is pp-|A|-saturated, and all primitive positive formulas true 



on (21; c\, . . . , c„) are true on (<£ sa t; ci, . . . , c n ). By Lemma 3.5.9 there is a homo- 
morphism h from (21; c±, . . . , c„) to (£ sa t; c\, . . . , c n ). Now, since <p(ci, . . . , c„) holds 
on £ sat and 21 is epc for T, we find that 21 |= <p(c\, . . . , c„), and conclude that p is a 
maximal pp-type of T. 

(Backwards.) Take 03|=T, ft,: 21 — >03a homomorphism, a a tuple of elements 
of 21, and 4>(x±, . . . , x n ) a primitive positive formula such that 03 |= tfi(h(a)). Let p be 
the pp-type of a in 2t. Since 03 is a model of T and h preserves all primitive positive 
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formulas, it follows that TUpU {(/>} is satisfiable. By maximality of p, we have that 
4> G p, and therefore 21 |= <fi(a). □ 

We close this section with an observation that will be needed later on. 



Lemma 3.5.11. 
limits. 



The class of all epc models of a theory T is closed under direct 



Proof. Suppose that 21 = limA< K 2lA for a sequence (21a)a<k of epc models of 
T, a is a tuple from 21, <f> an existential positive formula, and h is a homomorphism 
from 21 into another model of T such that 58 |= 4>(h(a)). Then there exists a A < k 
such that a = g\{a!) for a! from 21a (where g\ is as in the definition of direct limits). 
Note that h o g x is a homomorphism from 21a to 58, and since 21a is an epc model 
of T, 21a H 4>{o!)- Since g\ preserves existential positive formulas, we thus also have 
that 21 |= 4>{a). □ 

3.6. Quantifier-elimination, Model-completeness, Cores 

This section is concerned with structures 58 where various forms of syntactic 
restrictions of first-order logic have equal expressive power. In particular, we consider 
the situation that in 58 every first-order formula is equivalent to 

• a quantifier-free formula (Secti on |3. 6.1 ), 

• an existential formula (Section |3. 6.2 ), 

• an existential positive formula (Sections |3.6.3 and 3.6.4). 

For w-categorical structures, such a definability collaps translates nicely into a prop- 
erty of the operations that preserve 58, using the preservation theorems from Sec- 
All these collapse results will be 



tion 



3.4 A survey picture is given in Figure 3.2 



useful when studying the complexity of CSPs. For example, these results clarify when 
the so-called constraint entailment problem for 21 can be reduced to the constraint 



satisfaction problem for 21 (see Section 3.6.4). 

We also develop a theory that can be viewed as a positive variant of the classical 



theory of model-completeness and model companions (Section 3.6.4 and 3.6.5). This 
allows us to clarify the question which CSPs can be formulated with an w-categorical 



template (Section 3.6.6) 



3.6.1. Quantifier-elimination. We say that a r-structure 21 admits quantifier 
elimination if for every first-order r-formula there exists an equivalent quantifier-free 
r-formula. 

In this context, our assumption that we allow 1 as a first-order formula (de- 
noting the empty 0-ary relation) becomes relevant; Hodges 113 does not make this 
assumption, and therefore has to distinguish between quantifier- elimination and what 
he calls quantifier- elimination for non-sentences. We will later often make use of the 
following fact. 

Lemma 3.6.1 (Statement 2.22 in |65| ). An u- categorical structure 58 admits quan- 
tifier elimination if and only if it is homogeneous. 



Proof. By Theorem |3.1.4| the orbit of a fc-tuple of elements of 58 is first- 
order definable. Suppose that 58 has quantifier-elimination. Then two fc-tuples 
a = (ai, . . . , eifc) and b = {pi, ... , bk) are in the same orbit if and only if the mapping 
that sends <Zj to bi 7 for 1 < i < n, is an isomorphism between the structures induced 
by {ai, . . . , dfc} and by . . . , bk}- This proves homogeneity, because such a map- 
ping can then be extended to any automorphism of 58 demonstrating that a and b 
are in the same orbit. All implications in this argument can be reversed, which shows 
the claim. □ 
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Figure 3.2. Various forms of definability (left side), ordered by rel- 
ative strength, and the corresponding class of operations (right side). 
The labels on the arrows indicate the condition on the structure 
when the corresponding two forms of definability coincide (left side), 
and correspondingly when one set of operations locally generates the 
other (right side). 



3.6.2. Model-Completeness. The purpose of this section is to recall classical 
results about model-completeness; they inspired the new results of the next section 
about model-complete cores. A theory T is model- complete if every embedding be- 
tween models of T is elementary, i.e., preserves all first-order formulas. There are 
several equivalent characterizations of model-completeness, stated in Theorem |3.6.2| 
below. 



Theorem 3.6.2 (Theorem 7.3.1 in 114| ). Let T be a theory. Then the following 
are equivalent. 

(1) T is model- complete. 

(2) Every model of T is an existentially complete model of T . 

(3) Every first- order formula is equivalent to an existential formula modulo T. 



For the proof, we refer to 114 ; but note that the theorem has a positive variant 
(Theorem 3.6.11 below) with an analogous proof that will be presented in full length. 

Example 3.6.3. The structure (Qq\ <), where Qq denotes the non-negative ra- 
tional numbers, is not model-complete, because the self-embedding x t— > x + 1 of 
(Qo i <) °- oes not P reserve the formula cj>{x) — Vy (y > x 3z (x < z A z < y)) (which 
is satisfied only by 0). 

When 21 is not model-complete, we can sometimes find a model-complete structure 
23 that satisfies the same existential first-order sentences as 21. 

Definition 3.6.4. A theory U is a model companion of a theory T if 

• U is model-complete; 

• Every model of U embeds into a model of T; and 

• every model of T embeds into a model of U . 

Note that the last two conditions in this definition arc equivalent to saying that U 
and T imply exactly the same existential sentences (equivalently, the same universal 



sentences); the proof is analogous to the one of Proposition 2.4.5 
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If T has a model-companion U, then U is unique up to equivalence of theories. 
Theorem 3.6.5 (Theorem 7.3.6. in |114| ). For any two model- companions U±,U2 



of a theory T we have that U\ h U2 and U2 h U\ . 



The following theorem by Simmons 181 will not be used in this thesis; however, 



it has an existential positive version, Theorem 3.6.24 below, which has important 
consequences for the study of the CSP. Recall the joint embedding property, which 
has been defined for classes of structures in Section |3.2| a theory T has the joint 
embedding property (JEP) if for any two models 58i,582 of T there exists a model €. 
of T that embeds both 58 1 and 58 2 . 



Theorem 3.6.6 (from [181] ). Let T be a theory with the JEP. Then the following 
are equivalent. 

• T has an lo- categorical model companion. 

• For every n, T has finitely many maximal existential n-types. 



In particular, every oj -categorical theory has an lo -categorical model companion \1 74 



The consequence stated for w-categorical theories T at the end of Theorem |3.6.6| 



is an earlier result by Saracino 174 , and clearly follows from the first part. 

We say that a structure 21 is model-complete if and only if the first-order theory 
Th(2l) of 21 is model-complete. As we see below, for w-categorical structures 21 model- 
completeness of 21 can be translated into a property of the self-embedding monoid of 
21, and into a property concerning the axiomatization of Th(2l). In the following 
theorem, the equivalence of (1) and (4) can be found in [46| . The implication from 
(5) to (1) has been observed in [38] . 

Theorem 3.6.7. Let 58 be lo- categorical. Then the following are equivalent. 

(1) The structure 58 is model- complete. 

(2) 27i(58) is equivalent to a V3-theory. 

(3) 58 has a homogeneous expansion by existentially definable relations. 

(4) Every self-embedding o/58 is locally generated by the automorphisms of*B. 



(5) Every self-embedding of % is locally invertible (see Section 3.4-3). 



Proof. The implication from (1) to (2) holds for all structures 58 (we do not 
need cj-categoricity; see e.g. Theorem 7.3.3 in |114| ). The reverse direction is a direct 
consequence of a result known as Lindstrom's test (Theorem 7.3.4. in [114] ). 

We now prove (1) (3) (4) =>> ( 5) => (1). 

Suppose that (1) holds. By Theorem 3.6. 2| every first-order definable relation has 



an existential definition in 58. Hence, when we expand 58 by all existentially definable 
relations, every first-order formula has a quantifier-free definition. Then (3) follows 
from Lemma T3. 6. 11 

Now suppose that (3) holds. We claim that every self-embedding e of 58 is in the 
closure of the automorphisms of 58. The restriction e' of e to a finite subset S of the 
domain of 58 is an isomorphism between finite induced substructures of 58, and also 
an isomorphism between the expansion of 58 by all existentially definable relations. 
Homogeneity of this expansion implies that e' can be extended to an automorphism 
of 58, which proves the claim. 

The implication from (4) to (5) is trivial. Finally, the implication from (5) to (1) 
it is a direct consequence of Theorem 3.4. 12| □ 



Note that all finite structures are model-complete: self-embeddings of 58 are 
automorphisms, and hence they are elementary. Every relation that is first-order 
definable in a finite structure also has an existential definition. 
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Using the concept of model completeness, we can restate Theorem |3.2.10| clari- 
fying in which sense the structure 58 constructed in Theorem |3 . 2 . 1 0| is unique. In the 
following, t is a finite relational signature. 



Theorem 3.6.8 (a variant of Theorem 3.2.10). Let N be a finite set of finite 
connected r-structures. Then there is a model- complete r-structure 58 whose age is 
the class of all finite N -free structures. The structure 58 is unique up to isomorphism. 

PROOF. Theorem |3.2.10| states the existence of an w-categorical Af-free struc- 
ture which is universal for the class of all countable A/"-free structures which has a 
homogeneous expansion by primitive positive definable relations. By (3) =>■ (1) in 
Theorem |3.6.7[ the structure 58 is indeed model-complete. We have to show that 
every model-complete structure £ with the same age as 58 is isomorphic to 58. Let T 
be the first-order theory of 58, and S be the first-order theory of C Since 58 and €. 
have the same age, S and T imply the same existential sentences. By Theorem 3.6. 5| 



S and T are equivalent theories. By cj-categoricity of T, 58 and £ are isomorphic. □ 

3.6.3. Cores. We have already encountered the concept of a core of a finite 
structure in Section To recall, a core is a structure 58 such that all endomor- 
phisms of 58 are embeddings, and a structure 58 is a core of 21 if 58 is a core and 
homomorphically equivalent to 21. The concept of the core of a finite relational struc- 
ture plays an important role in the classification program for finite-domain CSPs. 
Three crucial properties of finite cores are: 



every finite structure 21 has a core 58 (Proposition 1.1.10 1; 
all core structures 58 of 21 are isomorphic (Proposition 1.1.10); 
orbits of fc-tuples in finite cores 58 are primitive positive definable (Proposi- 
tion 



1.2.91 



Also for every infinite structure 21 there is a core 58 such that CSP(2l) = CSP(58). 
This follows from Lemma |3. 5.8 and the following proposition. 



Proposition 3.6.9. 7/58 is an epc model for its universal negative theory, then 
58 is a core. 

Proof. Suppose 58 is epc for its universal negative theory T, and let h be an 
endomorphism of 58. By epc, for b\,...,bu in B, if 58 (= R(h(bi), . . . , h(bf.)) or 
58 |= (h(b x ) = h(b 2 )), then 58 |= R(bx,...,b k ) or 58 h Oi = h), respectively. It 
follows that h is an embedding. □ 

There are many equivalent definitions of when a finite structure is a core: for 
example, a finite structure is a core if and only if all endomorphisms are surjec- 
tive, or injective, or bijective, or all endomorphisms are automorphisms. For infinite 
structures (even when they are w-categorical) , these definitions are in general not 



equivalent, see 16 17 23 . As we will see in this section, our definition of cores 
is the most appropriate definition in many contexts, in particular in the context of 
constraint satisfaction for w-categorical templates. 

Example 3.6.10. The structure (Q; <) is easily seen to be a core: every endo- 
morphism of (Q; <) must be injective, and must be strong. In contrast, the random 
graph (V; E) is not a core. By the defining property of the random graph, (V; E) 
contains arbitrarily large finite cliques. By Lemma |3.1.5| it even has an infinite clique 
as a subgraph. Therefore, (V; E) has endomorphisms with the property that they 
map pairs of non-adjacent vertices to pairs of adjacent vertices, and thus is not a 
core. 
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Before we prove general results about existence and uniqueness of cores, we state 
important properties of cores and model-complete cores for w-categorical structures 
that follow in a straightforward way from previous fact^J 

Theorem 3.6.11. Let 58 be uj- categorical. Then 58 is a core if and only if every 
existential formula is equivalent to an existential positive formula ewer 58. Moreover, 
the following are equivalent. 

(1) 58 is a model- complete core; 

(2) Every first- order formula is equivalent to an existential positive one over%; 

(3) The orbits of n-tuples in 58 are primitive positive definable in 58; 

(4) The automorphisms locally generate the endomorphisms of^. 

Proof. The first statement is straightforward from Theorem |3.4.7| To prove the 
equivalence of (1), (2), (3), and (4), we show implications in cyclic order. 

For the implication from (1) to (2), assume (1) and let be a first-order formula. 
To show that <j) is equivalent in 58 to an existential positive formula, it suffices to 



show that every endomorphism of 58 preserves <p, by Theorem 3.4.7 Since 58 is a 



model-complete core, every endomorphism of 58 preserves all first-order formulas. 



For the implication from (2) to (3), let O be an orbit. By Theorem 3.3.8 O has 
a first-order definition. Assuming (2), O even has an existential positive definition. 
Note that every existential positive formula can be written as a disjunctionof primitive 
positive formulas, so let <fi be such a definition of O. We can also assume without loss 
of generality that none of the disjuncts in (j) implies another (otherwise, we simply 
omit it). Since O is a minimal first-order definable relation, (f> can only contain a 
single disjunct, and therefore is primitive positive. 

(3) implies (4). Assume (3), and let e be an endomorphism of 58. To show that 
e is locally generated by the automorphism of 58, let t be a finite tuple of elements 
of 58. We have to show that there is an automorphism a of 58 such that e(t) = a(t). 
The orbit of t is primitive positive definable, and hence preserved by e. So e(t) is in 
the same orbit as t, and we are done. 

(4) implies (1). Suppose (4), that is, suppose that all endomorphisms are gener- 
ated by the automorphisms of 58. Since the automorphisms preserve all first-order 



formulas in 58, the same is true for the endomorphisms of 58, by Proposition 3.4.8 □ 



The fact that in w-categorical model-complete cores the orbits of n-tuples are 
primitive positive definable is one of the three key facts for finite cores 58 that we 
have mentioned above. The other two facts, existence and uniqueness of model- 
complete cores for w-categorical structures, follow from more general theorems that 
apply not only to w-categorical structures, as we will see in Section [3.6.G| 

We finally note the following consequence of Theorem |3 . 6 . 1 1 1 for future use. 

Corollary 3.6.12. Every uj- categorical model- complete core 58 has a homoge- 
neous expansion by primitive positive definable relations. 

Proof. When 58 is an w-categorical model-complete core, then by Item (3) in 
Theorem |3. 6. 11[ all orbits of n-tuples are primitive positive definable. So it suffices to 
show that the expansion £ by all orbits of n-tuples in 58 is homogeneous. Let i be an 
isomorphism between two finite induced substructures of £, and let {s\, . . . ,s n } be 
the domain of i. Since i preserves the orbits of n-tuples of 58, (s±, . . . , s n ) lies in the 
same orbit as (tj, . . . , t n ). Hence, there exists an automorphism of 58 and therefore 
also of € that extends i. □ 



^Yet another chara cterizat ion of when an ij-categorical structure is a model-complete core can 
be found in Proposition 3.6.21 
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3.6.4. Core Theories. A theory T is called a core theory if every homomor- 
phism between models of T is an embedding. Note that a finite or w-categorical 
structure 03 is a core if and only if it has a core theory (for w-categorical 03, this is 



an easy consequence of the Lowenhcim-Skolem theorem - Theorem 2.3.3). 



Proposition 3.6.13. Let T be a first-order r -theory. Then T is a core theory if 
and only if every existential formula is equivalent to an existential positive formula. 

Proof. First assume that T is a core theory, and let </> be an existential formula. 
Then <f> is preserved by all embeddings between models of T. Since all homomorphism 
between models of T are embeddings, <f> is also preserved by all homomorphisms 
between models of T. Hence, Theorem |3. 4. 3| implies that (j> is equivalent modulo T to 
an existential positive formula. The converse implication is trivial. □ 

We would like to point out an interesting corollary for CSPs. Let 03 be a structure 
with finite relational signature r. The constraint entailment problem for 03 is the 
following computational problem. The input consists of a primitive positive r-formula 
4>, and a single atomic r-formula ip, both <p and ip with free variables x\, . . . ,x n . The 
question is whether <f> implies {entails) ip in 03, i.e., whether 

03 |= Vxi, . . . ,x n {<j> => ip) . 

Corollary 3.6.14. Let r be a finite relational signature, and let 03 be a r- 
structure whose first-order theory T is a core theory. Then there is a polynomial-time 
Turing reduction from the constraint entailment problem for 03 to CSP(03). 

Proof. Let <fi,ip be an input to the constraint entailment problem for 03. Since 
T is a core theory, -<ip is by Proposition |3.6.13| equivalent to an existential positive 
r-formula, and hence equivalent to a disjunction ipx V • • • V ip m of primitive positive 
formulas. Since the signature r is finite, we can consider the size of this disjunction 
is bounded by a constant, for all possible inputs. Then <f> entails ip if and only if for 
all 1 < i < m, we have that 3x%, . . . , Xk- {<f> A ipi) is false in 03. Like this we have 
reduced the entailment problem to solving a constant number of constraint satisfaction 
problems for the structure 03. □ 

If we combine the assumption that T is a core with the assumption that it is 
model-complete, we arrive at Theorem |3.6.15| Its proof closely follows the proof of 
Theorem 7.3.1 in 
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Theorem 3.6.15. Let T be a first-order theory over signature r. Then the fol- 
lowing are equivalent. 

(1) T is a model- complete core theory. 

(2) Every model of T is an existential positive complete model of T . 

(3) If 21, 03 are models of T and h is a homomorphism from 21 to 03 then there 
are an elementary extension £ of 21 and an embedding g of 03 into € such 
that gh is the identity on 21. 

(4) Every first- order formula is equivalent to an existential positive formula mod- 
ulo T. 

Proof. (1) implies (2) is immediate from the definition of epc models: if 21 and 
03 are models of T and h: A — >• B is a homomorphism from 21 to 03, then h must 
be an embedding since T is a core theory, and in fact must be elementary since T is 
model-complete. Hence, for every tuple a from A and any existential positive formula 
4> such that h{a) satisfies 4> we have that a also satisfies <fi. 

(2) implies (3). Assume (2). Let 21 and 03 be models of T, and let ft be a 
homomorphism from 21 to 03. Choose a to be a vector that enumerates the elements 
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of 2t. Since 21 is an epc model of T, h is an embedding. Hence, every existential 



sentence that holds in (58, h(a)) also holds in (21, a). Proposition 2.4.3 now directly 
implies (3). 

(3) implies (4). We first claim that if (3) holds, then every homomorphism be- 
tween models of T preserves all universal r-formulas. For if h is a homomorphism 
of 2t into 58, a a tuple from A and <j>(x) a universal r- formula such that 21 |= <fi(a), 
then taking £ and g as in (3) we have £ |= <j>(g(h(a))) and so 58 |= <j>(h(a)) since <fi 
is a universal formula. This proves the claim. It follows from Theorem |3.4.3| that all 
universal r-formulas are equivalent to existential positive r-formulas. 

To finally prove (4), let <j)(x) be any first-order r- formula, wlog. in prenex normal 
form. By a simple induction on the number of quantifier-blocks we can transform <fi 
to an existential formula, using the fact that the innermost quantifier block is cither 
existential or universal, and can therefore be transformed into an existential formula 



(see Theorem 7.3.1 in 114 ). Finally, existential r-formulas are preserved by homo- 
morphisms between models of T, since such homomorphisms must be embeddings. 
Hence, the entire formula is even equivalent to an existential positive formula by 
Theorem |3X3 

(4) implies (1). Existential positive formulas are preserved by homomorphisms 
between models of T. □ 

From this we obtain a positive version of a fact known as Lindstrom 's test (The- 
orem 7.3.4 in 114| ). 



Proposition 3.6.16. Let T be a A- categorical T-theory, for A > |r|, which has 
no finite models, and whose unique model of cardinality A is epc for T. Then T is a 
model- complete core theory. 

Proof. We prove that every model of T is an epc model of T and use Theo- 
rem 3.6.15 So let 21 and 58 be two models of T and let ft be a homomorphism from 
21 to 58. Let a be a tuple such that 58 |= cj)(h(a)) and suppose for contradiction that 
21 ^= 4>{a). Then we can put those two structures into a new 2-sorted structure (com- 
prising 21, 58, and the homomorphism h between them) with first-order theory T', and 
apply the Lowenheim-Skolem theorem (Theorem |2.3.3| here we use the assumption 
that A > |r|) to produce a countable model of T" (where both sorts have the same 
cardinality since T has no finite models). By applying Lowenheim-Skolem again, this 
time to T' augmented by sentences expressing a bijection between the two sorts over 
a signature expanded by a new function symbol, we obtain a two-sorted model of 
T' where each sort has cardinality A, inducing structures £ and £>, respectively. By 
assumption there exists an epc model of cardinality A, and by A-categoricity £ is an 
epc model of T. This contradicts the fact that we can express in T' that 21 is not an 
epc model of T. □ 

Proposition 3.6.17. LetT be a model- complete core theory. ThenT is equivalent 
to a V3 + -theory. 



PROOF. This is an immediate consequence of Theorem |3.4.6 (where the theory 



denoted by T in Theorem |3 .4. 6 is empty and $ from Theorem 3. 4. 6| equals the theory 



T from the statement here) because for any sequence (58i)i< K of models of T with 
homomorphisms : 58^ — > 58 j , the gij are elementary. By the Tarski-Vaught theorem 



(Theorem 3.5.31, we have that (lim l<K 58,) |= T. □ 



3.6.5. Core Companions. In this section we study when we can pass from a 
theory T to a model-complete core theory T' that has the same CSP. 
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Definition 3.6.18. Let T be a first-order t -theory. Then a t -theory U is called 
a core companion of T if 

• U is a model- complete core theory; 

• every model of U homomorphically maps to a model of T; 

• every model ofT homomorphically maps to a model of U . 

Recall from Proposition |2.4.5| that the last two items in Definition |3.6.18| are 
equivalent to requiring that T and U imply the same universal negative sentences. 

Proposition 3.6.19. Let T be a V3 + -theory with signature r. If T has a core 
companion U , then U is up to equivalence of theories unique, and is the theory of the 
class of all epc models of T . 

Proof. It suffices to show that the epc models of T are precisely the models of 
U. We first show that every model 93 of U is an epc model for T; that is, we have to 
show that 03 is a model of T, and that 03 is epc for T. 

Since U is a core companion of T, there is a homomorphism e from 03 to a model 
21 of T. The assumption that U is a core companion of T also implies that there 
exists a homomorphism / from 21 into a model £ of U. Then foe is a homomorphism 
between two models of U, and since U is a model-complete core theory it must be an 
elementary embedding. This shows in particular that e is an embedding. 

We claim that 03 is a model of the V3 + -theory T. Let cf> — Vy.ip be a sentence 
from T where ip is a disjunction of existential positive and negated atomic r-formulas, 
and let b be a tuple from 03. Since 21 is a model of T and therefore satisfies Vy.ip, in 
particular the tuple e(b) satisfies ip. If e(b) satisfies a negated atom in the disjunction 
ip then b also satisfies tp as e is an embedding. Otherwise, e(b) satisfies an existential 
positive formula in the disjunction ip, and f(e(b)) satisfies <fi in £ as well since / is a 
homomorphism. But this shows that b satisfies ip in 03 since / o e is elementary. Since 
this holds for all b, we have proven that 03 satisfies <j>. 

The verification that 03 is an epc model for T is similar, and as follows. Let g 
be a homomorphism from 03 into another model 21 of T, b a tuple from 03, and 4> 
an existential positive formula with 21 \= <fr(g(b)). We have to show that 03 \— cf>(b). 
Again, since U is a core companion of T there exists a homomorphism h from 21 
into a model € of U. Since U is a model-complete core theory, the mapping hog 
is elementary. Since h preserves existential positive formulas, £ |= <p{h{g(b))) . Since 
ho g is elementary, 03 |= 4>(b). 

Conversely, we show that every epc model 03 of T satisfies U. By Proposi- 
tion 3.6.17 U is equivalent to a V3 + -theory, and thus it suffices to show that 03 
satisfies all V3 + -consequences Wy.t/j(y) of U, where tp is a disjunctions of existential 
positive and negative atomic r-formulas. Let b be a tuple of elements of 03. We have 
to show that 21 |= ip(b). Since U is a core companion, there is a homomorphism h 
from 03 to a model 21 of U. Since 21 |= Vy.ip(y), at least one disjunct 6{hib)) of ip is 
true in 21. If 9 is a negative atomic formula, then 9(b) is also true in 03 since h is a 
homomorphism. Now suppose that 9 is an existential positive formula. Since U is a 
core companion of T, there is a homomorphism g from 21 to a model £ of T. Since g 
preserves 9 we have that £ |= 6(g(h(b))). Now 03 |= 9(b), since 03 is an epc model of 
T. In both cases we can conclude that 03 |= tp(b). □ 

Proposition 3.6.20. Let T be a V3 + -theory with signature r. Then T has a core 
companion if and only if the class of epc models of T is axiomatizable by a t -theory. 

Proof. If T has a core companion U, then Proposition |3. 6. 19| above implies that 
U axiomatizes the epc models of T. 
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For the converse, suppose that the class of epc models of T is the class of all 
models of a r-theory U . Then every model of U is in particular a model of T, and 
every model of T homomorphically maps to a model of U by Lemma |3.5.8| So we 
only have to verify that U is a model-complete core theory to show that U is the 
core companion of T. Every model 21 of U is an epc model of T, and in fact an epc 



model of U. It follows by the equivalence of (1) and (2) in Theorem 3.6.15 that U is 



a model-complete core theory. □ 
3.6.6. w-categorical model-complete cores. We have already seen in Theo- 



rem 



3.6.11 that whether an cj-categorical structure is a model-complete core can be 
characterized in many different ways. The results in Section |3.6.4| provide a further 
characterization of w-categorical model-complete cores in terms of their axiomatiza- 
tion, as we see in the following. 

Proposition 3.6.21. Let 25 be a countable uj- categorical structure. Then 25 is a 
model- complete core if and only if its theory is equivalent to a V3 + -theory. 



Proof. One direction is Proposition |3 . 6. 17 For the other direction, assume that 



the theory of 25 is equivalent to a V3 -theory T. Then T has no finite models, and 
hence T has a countably infinite model that is epc for T, by Lemma |3.5.8| Hence, 



by Proposition 3.6.16 T is a model-complete core theory (since T is w-categorical, 
it is easy to see that we can assume for this application that the signature of T is 
countable), and so 25 is a model-complete core. □ 

We now present an existential positive version of Simmons' theorem (Theo- 
rem 3.6.6), which answers the question which CSPs can be formulated with an oj- 
categorical template. For a satisfiable theory T, let be the equivalence relation 
defined on existential positive formulas with n free variables x±, . . . , x n (we could have 
equivalently used primitive positive formulas here) as follows. For two such formulas 
tpi and </>2, let (j>\ <f>2 if for all existential positive formulas ip with free variables 
X\, . . . , x n we have that {4>i, ip}UT is satisfiable if and only if {^ 2 , ^>}UT is satisfiable. 

Theorem 3.6.22. Let T be a theory with the joint homomorphism property (JHP; 



confer Proposition 2.4-6). Then the following are equivalent. 

(i) T has a core companion that is either oj- categorical or the theory of a finite 
structure. 

(ii) has finitely many equivalence classes for each n. 
(Hi) T has finitely many maximal existential positive n-types for each n. 
(iv) There is a finite or ui- categorical model- complete core 25 that satisfies an 
existential positive sentence <f> if and only if T U {(j)} is satisfiable. 

Proof. We show (i) (ii) ^> (Hi) ^> (iv) (i). 

(i) =4> (ii). Let U be the core companion of T. Since U and T entail the same 
universal negative sentences, we can deduce that for every existential positive formula 
ip the theory U U {iji} is satisfiable if and only if T U {i/j} is satisfiable; from which it 
follows that the indices of ^ and ^ coincide. 

For a proof by contraposition, assume that has infinite index for some n. Let 
ipi and 4>2 be two existential positive formulas from different equivalence classes of 
Hence, there is an existential positive formula V>3 such that exactly one of {ipi, ■03}Ui7 
and {-02, "1/^3} U U is satisfiable. This shows that ip% and ip2 are inequivalent modulo 
U. Therefore there are infinitely many first-order formulas with n variables that are 
inequivalent modulo U, and U can neither be w-categorical by Theorem |3. 1 .4| nor the 
theory of a finite structure. 

(ii) => (Hi). We show that every maximal ep-n-type p is determined completely 
by the equivalence classes of the existential positive formulas contained in p. Since 



3.6. QUANTIFIER-ELIMINATION, MODEL-COMPLETENESS, CORES 



75 



there are finitely many such classes, the result follows. Let p and q be maximal ep- 
ro-types such that for every (f>\ £ p there exists <p\ £ q such that (pi ~J (p[ and for 
every cp2 £ q, there exists <p' 2 £ p such that 02 ~„ 02- We aim to prove that p = q. 
If not then there exists, without loss of generality, ip £ p such that tp ^ q. Since q is 
maximal, TUgU {ip} is not satisfiable. By compactness, T U ip} is not satisfiable 
for some finite conjunction 8 of formulas from q. Now, £ q by maximality and 
there exists by assumption 8' £ p such that 8 ~J 0'. By definition of we deduce 
T U {0', ^} satisfiable iff T U {0, ip} satisfiable. Since the latter is not satisfiable, we 
deduce that neither is the former, which yields the contradiction that TUpU {ip} is 
not satisfiable. 

(Hi) =>■ (iv). Assume {Hi), and let S be the set of all existential positive sentences 
(p such that T U {<p} is satisfiable, together with the set of all universal negative 
consequences of T. By Proposition |2.4.6[ S has a model C, and by Theorem |2.3.3| we 
can assume that £ is either finite or countable. Lemma 3. 5. 8| gives a homomorphism 
from £ to a finite or countable epc r-model 03 of S. Note that also 03 satisfies 
exactly those existential positive sentences that are satisfiable together with T. We 
consider the signature r', which is the expansion of r by fi n relations of each arity 
n, corresponding to the maximal pp-n-types of T. Any model of T has a canonical 
(unique) expansion to a r'-structure, by the new relation symbols labeling tuples that 
attain their type. Consider this canonical r'-expansion 03' of 03. We will shortly 
prove that 03' is homogeneous. From this it will follow that 03' and 03 are finite, or lu- 
categorical by Lemma 3.2.9 (since variable identifications are primitive positive, there 
is only a finite number of inequivalent atomic formulas of each arity n), whereupon 
w-categoricity is inherited by its r-reduct 03. 

To prove that 03 is a model-complete core, we use Theorem 3.6. 11| and show that 
every first-order formula <p is equivalent to an existential positive formula over 03. 
Since 03 has a homogeneous expansion 03' by primitive positive definable relations, 
(p is equivalent to a Boolean combination of primitive positive formulas. Because we 
have added a relation symbol for each maximal pp-n-type of T, and there are finitely 
many of those for each n, the negation of a primitive positive formula is equivalent to 
a finite disjunction of maximal pp-n-types. This proves that <p is in 03 equivalent to 
an existential positive formula. 

It remains to be shown that 03' is homogeneous. An existential positive formula 
4>{x) is said to isolate a maximal ep-n-type p(x) of T, if p is the only maximal ep-n- 
type of T of which (p is a member. If there is only a finite number of maximal ep-n- 
types of T, then it follows that each has an isolating formula. Let /: (ai, . . . , a m ) H> 
{b\, . . . ,b m ) be a partial automorphism of 03' (in the signature r'). Let a' be an 
arbitrary element of B' . Consider the ep-n-types p(x\, . . . , x m ) of (ai, . . . , a m ) and 
q{xi, . . .,x m ,y) of ( ai, . . a m ,a') in 21. 

By Proposition |3.5.10[ each of these types is maximal, and is isolated by the ep- 
formulas 8 p {xi, . . . , x m ) and 8 q (xi, . . . , x m , y), respectively. Furthermore, the type of 
{b%, . . . , b m ) in 03 is p, because the partial automorphism of 21' respects the signature 
r'. But now, since By. 8 q (x\, . . . , x m , y) is in p (by maximality), we may deduce a b' 
such that 03' |= q (bi, . . . , b m ,b') and consequently 03' |= q{b%, . . . , b m , b'). It follows 
that /': (oi, . . . , a m , a') H> {bi, ■ . . , b m , b 1 ) is a partial automorphism of 03' (in the 
signature r'). A simple back-and-forth argument shows that we may extend to an 
automorphism of 03', and the result follows. 

For the implication (iv) (i), observe that a finite or cj-categorical structure 03 
is a model-complete core if and only if it has a model-complete core theory - this is an 
easy consequence of the Lowenheim-Skolem theorem (Theorem 2.3.31. So it suffices 
to show that the first-order theory of 03 and T have the same universal negative 
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consequences, by Proposition |2.4.5} A universal negative sentence </> is implied by T 
if and only if T U {-></>} is unsatisfiable, which is the case if and only if 58 does not 
satisfy -i</> (and hence satisfies 4>). □ 



Thcorcm |3.6.22 implies a necessary and sufficient condition when an CSP can be 



formulated with an w-categorical template. 

Corollary 3.6.23. Let % be a structure with a finite relational signature. Then 
the following are equivalent. 

(1) There is an uj- categorical template 58 such that CSP(58) = CSP(2l); 

(2) has finite index for all n; 

(3) There exists a structure 58 with CSP(58) = CSP(2l) which has for all n > 1 
finitely many primitive positive definable relations of arity n. 

Proof. The implications from (f) to (3) and from (3) to (2) are easy. The 



implication from (2) to (1) follows from (ii) =4> (iv) in Theorem |3. 6. 22 □ 



We present a simple new proof of the following result from 23 , which will be of 
central importance in later chapters. 



Theorem 3.6.24 (of [23] ) . Every uj- categorical structure 21 is homomorphically 
equivalent to an to -categorical model- complete core 58. All model- complete cores o/2l 
are isomorphic to 58. 

PROOF. Let T be the first-order theory of 21; clearl y T h as the JHP. Sin ce T is 
w-categorical, ~J has finite index for each n (Theorem 3.1.4), and Theorem 3.6.22 



implies that T has a core companion S which is either w-categorical or the theory of 



a finite structure. By Proposition 3.6. f 9 the core companion of Th(2l) is unique up 
to equivalence of first-order theories. Since Th(58) is w-categorical or the theory of a 
finite structure, it follows that 58 is unique up to isomorphism. □ 

Since the mo del- complete core 58 of 21 from the previous theorem is unique up 
to isomorphism, we call it the model-complete core of 21. The following gives an 
indication that the model-complete core of an cj-categorical structure 21 is typically 
'simpler' than 21. 

Proposition 3.6.25. Let 2t be an co-categorical structure, and let 58 its model- 
complete core. Then 

• for every n, the number of orbits of n-tuples in 58 is at most the number of 
orbits of n-tuples in 58; 

• if$iis homogeneous, then 58 is homogeneous as well. 

Proof. Let / be a homomorphism from 21 to 58, and g be a homomorphism from 
58 to 21. It suffices to show that when two n-tuples t\ , t% from 58 are mapped by g 
to tuples si, S2 in the same orbit in 21, then t\ and t% lie in the same orbit in 58. Let 
a be an automorphism of 2i that maps s\ to s 2 - Since 58 is an w-categorical model- 
complete core, there are primitive positive definitions <pi and <f>2 of the orbits of t\ and 
t2- Since g, a, and / preserve primitive positive formulas, the tuple t 3 := f{ag(t\)) 
satisfies 4>l- As a{g{t{)) = s 2 = 3(^2), the tuple t 3 can also be written as /(ff^)), 
and hence also satisfies 4>2- Thus, <pi and <p2 define the same orbit, and t\ and i 2 are 
in the same orbit. 

For the second part of the statement, suppose that h is an isomorphism between 
two finite substructures 21 and 21' of 58. Then g (21) induces in 21 a structure that is 
isomorphic to 2t, since otherwise the mapping x i-> f(g(x)) would not be an endomor- 
phism of 58. Similarly, g(2l') induces in 21 a structure that is isomorphic to 21' and 
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21, and by homogeneity of 21 there exists an automorphism a of 21 that maps g(2t) 
to <?(2l'). The mapping e : x >-> f(ag(x)) is an endomorphism of 05, and therefore 
an embedding. By Theorem |3.6.11| this mapping is locally generated by the auto- 
morphisms of 03, and in particular there exists an automorphism j3 of 03 such that 
f3(x) = e(x) = h{x) for all elements x of 21. This proves homogeneity of 03. □ 

When the template of a CSP is a model-complete core, then this can be exploited 
in the study of the CSP in many ways. For instance, we have the following consequence 



of Theorem 3.G.11 and Lemma 1.2.8 which is essentially from 23 . 



Corollary 3.6.26. Let 03 be an lj- categorical model- complete core with domain 
B , and let £ be the expansion of 03 by finitely many unary singleton relations, that is, 
relations of the form {c} for some elements c of 03. If £ has a finite signature reduct 
with a hard CSP, then so has 03. 

Proof. Let {ci}, . . . , {ck} be the relations of £ that have been added to 03, and 



let £' be the finite signature reduct of £ having a hard CSP. By Theorem 3.6.11 the 
orbit of (ci, . . . , Cfc) in 03 has a primitive positive definition <f> in 03. Let 03' be the 
reduct of 03 whose signature contains all relation symbols mentioned in <f), and the 
relation symbols of £' that are also relation symbols in 03; observe that the signature 



of 03' is finite. NP-hardness of CSP (03) follows from Lemma 1.2.8 □ 



CHAPTER 4 



Examples 




The running examples in the previous chapter were the linear order (Q; <) and 
the random graph (V;E). Structures definable in those structures provide further 
examples of w-categorical structures, and they will be studied in great detail in Chap- 
ter [6j[9j and [TO] In this chapter, we present other cj-categorical structures 21 that will 
not be treated at the same level of detail as it will be done for (Q; <) and for (V; E) 
in Chapter [9] and Chapter 10 For example, we treat homogeneous C-relations, dense 
semi-linear orders, and the atomless Boolean algebra. In each case, we give a brief 
discussion on what is known about CSPs with templates that can be defined in those 
w-categorical structures. Thereby, we revisit many problems from Section |1.5| We 
also discuss w-categorical structures that serve as templates for problems in connected 
monotone monadic SNP. 

The w-categorical structures presented in this chapter are chosen so that they 
illuminate the diversity of the class of all w-categorical structures, and so that many 
computational problems and classes of computational problems from the literature 
can be formulated as CSPs for those structures. 



4.1. Phylogeny Constraints and Homogeneous C-relations 

The rooted-triple satisfiability problem from Section [1.5. 2| can be formulated as 
CSP(23) for an w-categorical template 03 (an observation from [23] ). There are various 
different ways how to define such a structure *B; the most convenient for us is via 
amalgamation. 
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Let T be the class of all finite rooted binary trees X. The leaf structure € of 
a tree T € T with leaves L is the relational structure (L; |) where | is a ternary 
relation symbol, and ab\c holds in £ iff yca(a,b) lies below yca(b,c) in T (recall that 
yca(a,b) denotes the youngest common ancestor of a and i) in a rooted tree T (see 
Section [l.5.2 ). We also call T the underlying tree of £. Let C be the class of all leaf 



structures for trees from T ■ 

Proposition 4.1.1. The class C is an amalgamation class. 

Proof. Closure under isomorphisms is by definition, and the hereditary property 
is straightforward to verify. For the amalgamation property, let 03 1, 032 € C be such 
that 21 = 03 1 fl ®2 is an induced substructure of both 03 1 and 032. We want to show 
that there is an amalgam of 53 1 and 032 over 21 in C. We inductively assume that the 
statement has been shown for all triples (21, 03' x , 03 2 ) where B[ U B' 2 is a proper subset 
of Bi UB 2 . 

Let Ti be the rooted binary tree underlying 03 1, and % 2 the rooted binary tree 
underlying 032. Let 03} € C be the substructure of 53 1 induced by the vertices below 
the left child of %x, and 03^ £ C be the substructure of 53 1 induced by the vertices 
below the right child of Ti. The structures 03^ and 53 1 are defined analogously for 
532 instead of 53i. 





Figure 4.1. Illustration for the proof of Proposition 4.1.1 



First consider the case that there is a vertex u that lies in both 03 } and 03};, and a 



vertex v that lies in both 03}> and 03 ^ (see Figure 4.1 for an illustration). We claim that 
in this case no vertex w from 53| can lie inside 53i: for otherwise, w is either in 03}, 
in which case we have uw\v in 03i, or in 53^, in which case we have vw\u in 03i. But 
since u,v,w are in A, this is in contradiction to the fact that uv\w holds in 532- Let 
£' € C be the amalgam of 03i and 532 over 2t) which exists by inductive assumption, 
and let 1' £ T be its underlying tree. Now let X be the tree with root r and %' as a 
left subtree, and the underlying tree of 53| as a right subtree. It is straightforward to 
verify that the leaf structure of X is in C, and that it is an amalgam of 53i and 532 
over 21 (via the identity embeddings). 

Up to symmetry, the only remaining essentially different case we have to consider 
is that B\ U B\ and B\ U B 2 are disjoint. In this case it is similarly straightforward 
to first amalgamate 03} with 03^ and 53^ with 03| to obtain the amalgam of 53i and 
082; the details are left to the reader. □ 
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Let 23 denote the Frai'sse-limit of C. The structure 23 is homogeneous, so it is 
a fortiori model-complete. It is straightforward to verify that 23 is a core. Since 
the age of 23 is the class of all leaf structures for structures from T~, it is obvious 
that CSP(23) is the rooted triple consistency problem. The relation | in 23 is closely 
related to so-called C-relations, following the terminology of [2]. C-relations became 
an important concept in model theory, see e.g. [105| . They are given axiomatically; 
the presentation here follows [2 
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A ternary relation C is said to be a C-relation on a set L if for all a, b, c, d G L 
the following conditions hold: 

CI C(a;b,c) -> C(a;c,6); 
C2 C(a;b,c) ->C(&;a,c); 
C3 C(a; b, c) -> C(a; d, c) V C(d; b, c); 
C4 a^6^C(a;fe,fe). 
A C-relation is called dense if it satisfies 

C7 C(a; b, c) -> 3e (C(e; b, c) A C(a; 6, e)). 

The structure (L; C) is also called a C-set. A structure T is said to be relatively 
k-transitive if for every partial isomorphism / between induced substructures of F 
of size k there exists an automorphism of T that extends /. Note that a relatively 
3-transitive C-set is necessarily 2-transitive (i.e., has a 2-transitive automorphism 



group, as defined in Section 3.3.3). 



Theorem 4.1.2 (Theorem 14.7 in [2]). Let (L:C) be a relatively 3-transitive C- 
set. Then {L; C) is uj- categorical. 

In fact, there is, up to isomorphism, a unique relatively 3-transitive countable 
C-set which 

• is uniform with branching number 2, that is, if for all a, 6, c G L we have 
C(a; 6, c) V C(b; c, a) V C(c; a, 6), 

• is dense, and 

• satisfies ^C(a;a,a) for all a e L. 

(See the comments in [2] after the statement of Theorem 14.7; the condition that 
—>C(a;a,a) for all (equivalently, for some) a G L has been forgotten there, but is 
necessary to obtain uniqueness.) 

It is straightfoward to verify that the Frai'sse-limit 23 of the amalgamation class 



from Proposition 4.1.1 has the same automorphism group (equivalently, is first-order 



interdefinable; see Section 3.3) to the relatively 3-transitive countable C-set that is 
uniform with branching number 2, dense, and satisfies ^C(a;a,a) for all a G L. 

A similar approach is possible to define a homogeneous template for the quartet 



satisfiability problem from Section |1.5.2 Alternatively, an cj-categorical template 



(£>; Q) for the quartet satisfiability problem can be given via a first-order definition in 
the structure 23 = (B; |) constructed above for the rooted triple consistency problem. 

The following definition will be useful now, and later. Since yea is associative as 
a binary operation, it makes sense to write yca(vi, . . . ,vi) for 

yca(vi,. . . , yca(vi-t,vi) ...) . 

Definition 4.1.3. When uj., . . . ,u% and vi, . . . , vi are leaves in a rooted tree T, 
then we write U\ . . . Mfc|i>i . . . , Vi if u :— yca{u\, . . . , u^) and v := yca(vi, . . . , v{) are 
disjoint in T, i.e., neither u lies above v nor v lies above u in T. 

The first-order definition of Q(x, y, u, v) is 

(xy\uv) V (uv\x A v x\y) V {xy\u A yu\v) . 
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Indeed, when u,v,x,y € -B, and T is the tree underlying the substructure of (B; |) 
induced by {u, v, x, y}, then the given formula describes the situation that the shortest 
path from x to y in T does not intersect the shortest path from u to v in 1. Note 
that whether this is true is in fact independent from the position of the root of T. 
We leave the verification to the reader that CSP((B; Q)) indeed describes the quartet 
satisfiability problem studied in comptuational biololgy. Lemma |3.1.10| implies u- 
categoricity of (B;Q). Similarly as for the C-relation given above, an axiomatic 
treatment of (B;Q) has been given in [2]; there, the relation Q has been called 
a D-relation, and this became standard terminology in model theory. As we have 
mentioned above, the structure (B; Q) can also be defined as a Frai'sse-limit of finite 



/^-structures; Cameron 65 calls the structures in the age of (B; C) Boron trees. 



4.2. Branching-Time Constraints 

The branching-time satisfaction problem from Section |1.5.3| can be formulated 
as CSP(03) for an cj-categorical structure 03 = (D; <, || , 7^); this has already been 
observed in [44] . This time, 03 has a simple explicit description. The domain B 
consists of the set of all non-empty finite sequences of rational numbers. For a = 
(gi, gi, . . . , q n ), b — (<7i, q[, . . . , q' m ), n < m, we write a < b if one of the following 
conditions holds: 

• a is a proper initial subsequence of b, i.e., n < m and qi = q[ for 1 < i < n; 

• qt = q[ for 1 < i < n, and q n < q' n . 

We define a < b if a < b or a = 6, and || is binary relation that contains all pairs of 
elements that are incomparable with respect to < (in particular, we have a \\ a for 
all a). A proof that 03 = (B; <, ||, ^) is 1-transitive and w-categorical can be found 
in [2] (Section 5). 

The reduct (B; <) of this structure is a semi-linear order, i.e., for all x £ D, the set 
{y I y < x } is linearly ordered by <; such structures have been studied systematically 



in the context of infinite permutation groups; see 65 82 . The structure 03 is not 
homogeneous, and therefore cannot be described as the Frai'sse-limit of a class of 
finite structures. However, the expansion of 03 by the relation xy\z with the primitive 
positive definition 

3u ((u < x) A (u < y) A (u \\ 2)) 

is homogeneous, and it is indeed possible to describe the structure (D; <, ||, 7^) as the 
reduct of the Frai'sse-limit of an amalgamation class; the respective proof is similar to 
the proof of Proposition |4.1.1| It follows that 03 is model-complete. 

4.3. Set Constraints 

Here we discuss how the set constraint satisfaction problems discussed in Sec- 



tion 1.5.5 and many other set constraint satisfaction problems, can be formulated 
with w-categorical templates, following |32| . 

Here, a set constraint satisfaction problem is a CSP for a template with a first- 
order definition in the structure 6 with domain "P(N), the set of all subsets of natural 
numbers, and with signature {n, U, c, 0, 1}, where 

• n is a binary function symbol that denotes intersection, i.e., n e = fl; 

• U is a binary function symbol for union, i.e., U = U; 

• c is a unary function symbol for complementation, i.e., c s is the function 
that maps S C N to N \ S; 

• and 1 are constants (treated as 0-ary function symbols) denoting the empty 
set and the full set N, respectively. 
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A set constraint language is a relational structure with a set of relations with 
a quantifier- free first-order definition in 6; we always allow equality in first-order 
formulas. For example, the relation {(x,y, z) € "P(N) 3 \ xDy C z} has the quantifier- 
free first-order definition z n (x Hy) = x D y over &. 



Proposition 4.3.1 (follows from Proposition 5.8 in |153 ). Let 25 be a set con 



straint language with a finite signature. Then CSP (25) is in NP. 

The first-order theory of the structure & is certainly not w-categorical - it is 
easy to verify that there are infinitely many pairwise inequivalcnt first-order formu- 
las with one free variable. However, all set constraint satisfaction problems can be 
formulated with an w-categorical template. To see this, first note that the structure 
(■P(N); U, n, c, 0, 1) is a Boolean algebra, with 

• playing the role of false, and 1 playing the role of true; 

• c playing the role of -i; 

• n and U playing the role of A and V, respectively. 

To facilitate the notation, we write x instead of c(x), and x ^ y instead of ->(x = y). 
It is well-known that all countable atomies^] Boolean algebras are isomorphic 



(Corollary 5.16 in 130 ; also see Example 4 on page 100 in 113 ). Let 21 denote 
such a countable atomless Boolean algebra; the domain 21 is denoted by A. Since the 
axioms of Boolean algebras and the property of not having atoms can all be written 
as first-order sentences, it follows that 21 is w-categorical. We also remark that the 
structure 21 has quantifier elimination (see Exercise 17 on Page 391 in 113| ). The 



link between the set constraint satisfaction problems over 2 N mentioned in Section [L5 
and the atomless Boolean algebra is the following. 

Proposition 4.3.2. Let £ be a set constraint language. Then there exists an 
uj- categorical structure 25 such that 25 and £ have the same existential theory. In 
particular, when £ has finite signature, then 25 and £ have the same CSP. 

Proof. Let fa, fa, ■ ■ • be quantifier-free first-order formulas that define the re- 
lations i?f,i?f,... of £ over 6 = fP(N); U, D, c, 0, 1). Let Rf,Rf,... be the re- 
lations defined by fa , fa , . . . over the atomless Boolean algebra 21. The structure 



25 = (A; Rf , R%, . . . ) is w-categorical (see the comment after Lemma 3.1.10). To 
verify that 25 and £ have the same existential theory, let $ be a conjunction of 
atomic formulas over the signature {i?i, R2, ■ ■ • }■ Replace each atomic formula of the 
form Ri(x\, . . . ,Xk) in $ by the formula fa(%i, ■ ■ ■ ,Xk). The resulting formula is a 
quantifier-free first-order formula in the language of Boolean algebras, {U, f~l, c, 0, 1}. 
We claim that $ is satisfiable in © if and only if it is satisfiable in 21. This follows 
from Corollary 5.7 in 1 153 : $ is satisfiable in some infinite Boolean algebra if and 
only if $ is satisfiable in all infinite Boolean algebras. □ 



A large class of tractable set constraint languages has been described in 32 (the 
description is in terms of polymorphisms); the class given there contains all known 
tractable set constraint languages, and is maximal tractable in the sense that every 
strictly larger class of set constraint languages contains a finite subset with an NP- 
hard CSP. 

4.4. Spatial Reasoning 

The principal difference between why the network satisfaction problem for RCC5 



(introduced in Section 1.5.61 is not a set constraint satisfaction problem as introduced 



-^An atom in a Boolean algebra is an element i/O such that for all y with x n y = y and x ^ y 
we have y = 0. If a Boolean algebra does not contains atoms, it is called atomless. 
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in the previous section is that in RCC5 we exclude the empty set as a possible value 
for the variables. To formulate the CSP for the network satisfaction problem of 
RCC5 and its fragments with w-categorical templates, we again use structures with a 
first-order definition in the atomless Boolean algebra, but restrict those structures to 
non-empty elements with non-empty complement (this observation has already been 



made in 84 , Proposition 4.4). 



Formally, let 21 be the atomless Boolean algebra, and let DR, PO, PP, PPI, EQ be 
the binary relations with the following first-order definition in 21 (and their intuitive 
meaning in quotes). 



DR(x,y) iff (xny = 0)Ax^yAx,y<£ {0,1} 

l x and y are disjoint' 
PP(x,y) iff (x n y = x) A x ^ y A x,y £ {0, 1} 

l y properly contains x' 
PPl(x,y) iff (iny = i/)A^i/Aa:,j/^{0,l} 

l x properly contains y' 
EQ(x,y) iff x = yAx,y${0,l} 

'x equals y' 

PO(x, y) iff -DR(i, y) A ^PP(x, y) A -.PPI(x, y) A x ^ y A x, y £ {0, 1} 
l x and y properly overlap' 

ft is not hard to see that those five relations form a proper relation algebra 
over the base set A \ {0, 1} whose abstract relation algebra is isomorphic to RCC5. 
Moreover, when T) is the structure that contains all binary relations that are first- 
order definable in (A\{0, 1}; DR, PO, PP, PPI, EQ) (so that we can associate a binary 
relation from 2) to every element of RCC5 in the natural way), then CSP(£>) and the 
network satisfaction problem for RCC5 are essentially the same problem (in the sense 



of Section 1.3.2). The structure T) has a (1-dimensional) first-order interpretation in 



21, and hence is w-categorical by Lemma 3.1.10 



4.5. CSPs and Fragments of SNP 

Recall that a constraint satisfaction problem can be viewed as a class of finite 
structures with finite relational signature (as described in Section [Pj and Section 1.4), 



namely the class of all satisfiable instances of the CSP. In this section we study the 
question when this class can be described by a r-sentence <& from a fixed logic C in 
the sense that for all finite structures 21, we have that 21 \= $ if and only if 21 € C. If 
there is such a sentence then we say that CSP(*B) is in C. 

The first two logics considered here will be first-order logic, and monadic SNP. 
It turns out that CSPs that can be described by a sentence from those logics can 
be formulated with cj-categorical templates. Finally, we present a new and more 
expressive logic, called Amalgamation SNP. Again, every problem in amalgamation 
SNP describes a problem in NP that can be formulated as CSP(Q3) for an w-categorical 
template *B, and the universal-algebraic techniques presented in later sections can be 
applied to study the computational complexity of the problems in this logic. 



4.5.1. First-order Definable CSPs. We first consider the situation where 
CSP (03) is in FO, i.e., can be described by a first-order sentence (in the sense just 
described). The class of CSPs in FO is quite restricted. It is not hard to see that 
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when CSP (25) is in FO, then in particular it can be solved in deterministic logarith- 
mic space. We will see that when CSP (21) is in FO, then there exists an w-categorical 
structure 25 that has the same CSPQ We use the following famous result. 

Theorem 4.5.1 (Homomorphism Preservation in the Finite 173| ). Let r be a 



finite relational signature, and let $ be a first-order T-sentence. Then $ is equivalent 
to an existential positive first-order sentence on all finite r-structures if and only if 
the class of all finite r-models o/$ is closed under homomorphisms ( Definition \l .l.ffy . 

In the rest of this section, r is a finite relational signature, and 25 be a r-structure. 
Recall that CSP(25), viewed as a class of finite r-structures, is closed under inverse 
homomorphisms and disjoint unions. In particular, the class of all finite r-structures 
that do not homomorphically map to 25 is closed under homomorphisms, and by 
Theorem |4.5.1| describable by an existential positive r-sentence This leads us to 
the following. 

Theorem 4.5.2. 7/CSP(25) is in FO, then there exists an uj- categorical structure 
25' that has the same CSP. 

Proof. From the remarks that precede the statement of the theorem, Theo- 
rem 4.5.1 shows that there is an existential positive r-sentence ^ such that 21 ho- 
momorphically maps to 25 if and only if 21 satisfies The sentence -i\E f can be 
re-written as a universal negative sentence in conjunctive normal form; let $ be such 
a universal negative sentence of minimal size. We claim that the canonical database £ 
for each conjunct in $ is connected. To see this, suppose that £ has several connected 
components. If one of them does not homomorphically map to 25, then $ was not 
of minimal size, since the corresponding conjunct could have been replaced by the 
(smaller) conjunctive query for the component. If all components homomorphically 
map to 25, then so does £, a contradiction to the fact that £ is the canonical database 
of a conjunct of <I>. 



Therefore each conjunct in <£> is connected, and we can apply Theorem |3.2. 10 



to 



the finite set N of canonical databases for all the conjuncts in <3>. From Theorem 3.2.10 
we obtain an w-categorical r-structure which is universal for CSP(25), which is what 
we had to show. □ 

4.5.2. CSPs in Monadic SNP. Also every CSP in monadic SNP can be for- 
mulated with an w-categorical template. 

Theorem 4.5.3 (from |29| ). Let £ be a structure with a finite relational signa- 
ture. If CSP(£) can be described by a monadic SNP sentence $, then there is an 
uj -categorical 25 such that CSP(25) = CSP(£). 

Proof. By Corollary |1.4.14[ we can assume without loss of generality that $ is 
a connected and monotone monadic SNP sentence. 

We assume without loss of generality that $ is written in negation normal form. 
Let Pi, . . . , Pk be the existential monadic predicates in $. Let r' be the signature 
containing the input relations from r, the existential monadic relations Pi, and new 
symbols P[ for the negative occurrences of the existential relations. 

By monotonicity, all such literals with input relations are positive. For each exis- 
tential monadic relation P, we introduce an existentially quantified monadic relation 
symbol P[, and replace negative literals of the form -iPj(a;) in $ by P[{x). We shall 
denote the r'-formula obtained from $ after this transformation by We define M 
to be the set of r'-structures containing for each clause i\> in $' its canonical database 

2 An exact characterization of those o;-categorical structures that are in FO can be found in |33| , 



obtained by a slight modification of a proof for finite domain CSPs in 137 
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(as defined in Section 1.4.3). We shall use the fact that a r'-structure 21 satisfies a 
clause ip if and only if the the canonical database of ip is not homomorphic to 21. 
Since 3> is connected, all structures in Af are connected. 



Then Theorem 3.2.10 asserts the existence of a Af-hee w-categorical r'-structure 
93' that is universal for all A/"-free structures. We use 93' to define the template 93 with 
the properties required in the statement of the theorem we are about to prove. The 
structure 93 is the r-reduct of the restriction of 93' to the points with the property 
that for all existential monadic predicates Pi, 1 < i < k, either P, or P[ holds (but 



not both Pi and P[). It follows from Theorem 3.1.10 that reducts of w-categorical 
structures, and restrictions to first-order definable subsets of oj-categorical structures 
are again cj-categorical. Hence, the resulting r-structure 93 is w-categorical. 

We claim that a r-structure 21 satisfies <& if and only if 21 homomorphically maps 
to 93. First, let 21 be a structure that has a homomorphism h to 93. Let 21' be the 
r'-expansion of 21 such that for all i < k and a E A the relation Pi (a) holds in 21' if 
and only if P(/i(a)) holds in 93', and P/(a) holds in 21' if and only if P/(%)) holds 
in 93'. Clearly, h defines a homomorphism from 21' to 93'. In consequence, none of 
the structures from Af maps to 21'. Hence, the r-reduct 21 of 21' satisfies 

Conversely, let 21 be a r-structure satisfying $. Consequently, there exists a r'- 
expansion 21' of 21 that satisfies the first-order part of and where for every a G A 
exactly one of Pi (a) or P/(a) holds. Clearly, no structure in Af is homomorphic to 
21', and by universality of 93' the r'-structure 21' is an induced substructure of 93'. 
Since for every a £ A exactly one of Pi (a) and P/(a) holds, 21' is also an induced 
substructure of the restriction of 93' to B. Consequently, 21 is homomorphic to the 
r-reduct of this restriction. This completes the proof. □ 

4.5.3. Amalgamation SNP. In this section we introduce a logic that describes 
only CSPs with w-categorical templates, and which we call Amalgamation SNP, or 
short A SNP. 

Definition 4.5.4. Amalgamation SNP is the logic that consists of all monotone 
SNP sentences $ where the class of all finite models of the first-order part of $ has 
the amalgamation property. 

It can be verified that the examples of SNP sentences given for CSP((Z; <)) in 



Example 1.4.2 and for CSP((Z; Betw)) in Example 1.4.3 are in fact Amalgamation 



SNP sentences. Recall that a structure is called finitely bounded if its age can be 



described by finitely many forbidden induced substructures (Definition 3.2.7) 



Proposition 4.5.5. Every sentence in Amalgamation SNP describes a problem 
of the form CSP(93) for an u- categorical structure 93 that can be expanded to a ho- 
mogeneous finitely bounded structure. 

Proof. To prove the first statement, let $ be a sentence in ASNP, let r be the 
input signature of $ (i.e., the free relation symbols in $), and let </> be the first-order 
part of $ (which is a cr-formula, for u D t). Since the class of all finite models of <p> 
has the amalgamation property, Theorem |3.2.2| asserts the existence of a countable 
homogeneous cr-structure £ whose age is exactly the class of all finite cr-structures 
that satisfy (p. The structure € is finitely bounded; this is witnessed by the set 
of all cr-structures 9t with a minimal number of vertices that do not satisfy (p. None 
of those structures 9t can have larger size than the number of variables of <p, and 
hence Af is finite. To see that Af indeed bounds the age of £, let 21 be a finite 
induced substructure of £. Then 21 cannot have a substructure from Af, since any 
such substructure would falsify <p>, and hence also 21 would not satisfy (p. Now suppose 
that £ does not contain 21 as a finite induced substructure. This means that 21 falsifies 
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. Then there is a minimal number of elements witnessing that the universal formula 
is false in 21, and those elements induce a structure from JV, which proves the 
claim. By Lemma [3.2.9 the structure £ is w-categorical. The r-reduct 03 of £ is also 



cj-categorical (Lemma 3.1.101 



We finally show that every finite r-structure 21 satisfies $ if and only if it homo- 
morphically maps (in fact, embeds) into 03. When 21 is a r-structure that satisfies 
then there exists a er-expansion 21' of 21 that satisfies 4>. By universality of £ the 
same map is an embedding of 21' into £, and this gives an embedding of 21 into 03. 

Conversely, suppose that there is a homomorphism / from a r-structure 21 to 03. 
Then we construct the c-expansion 21' of 21 by putting for all S € a \ r the tuple 
(ii, . . . , t n ) into S" 21 if and only if (/(ii), . . . , f(t n )) € 5 C . Suppose for contradiction 
that there were a tuple £ = (ti , . . . , tjfe) of elements that violates a clause of </> in this 
expansion 21'. Then the image of / induces a substructure in £ that also violates <f>, 
since all relation symbols from r appear negatively in <f>. This is a contradiction, and 
hence 21' satisfies <j>. We conclude that 21 satisfies <&. □ 

We want to give an example of an w-categorical structure 03 such that CSP(03) 
is in NP, but not in Amalgamation SNP. To show that CSP(03) is not in ASNP, the 
sequence (m„)„>i of numbers of maximal pp-n-types of Th(03) turns out to be useful 
(see Section 3.6.6[ ) . Note that when two structures have the same CSP, then they have 



the same number of maximal pp-n-types. Hence, when $ is a sentence from ASNP, 
there is a unique sequence (m„) n >i such that m n equals the of maximal pp-n-types of 
Th(03) for any 03 such that <£> describes CSP(03). We call (m„)„>i the characteristic 
sequence of 

Proposition 4.5.6. Let <£> be a sentence from Amalgamation SNP. Then the 
characteristic sequence of $ is in 0(2 p (™)) ; for some polynomial P. 



Proof. Proposition |4.5.5| shows that there exists an w-categorical structure 03 



such that $ describes CSP(03). The proof of Proposition 4.5.5 shows that there exists 
an expansion £ of 03 by finitely many relation symbols which is homogeneous. Hence, 
an orbit of n-tuples in £ is uniquely described by the atomic formulas that hold on 
a (equivalently, all) tuples from this orbit, and since the signature of £ is finite, this 
gives a bound of 2 p (") on the number of orbits of rt-tuplcs in £, for some polynomial 
whose degree equals the maximal arity of the relations in £. The number of orbits of 
n-tuples is an upper bound on the number of maximal pp-n-tpyes (since two tuples 
with the same orbit clearly have the same pp-type). □ 

We can now present the example of an cj-categorical structure 03 such that 
CSP(03) is in NP, but not in ASNP. We use a CSP for a set constraint languages, as 
introduced in Section [1.5.51 

Example 4.5.7. Let 03 be the structure that contains all relations of arity at most 
three with a quantifier-free first-order definition in the atomless Boolean algebra 21. 



Since 21 is cj-categorical, the signature of 03 is finite. By Proposition 4.3.1 CSP(03) is 
in NP. To prove that CSP(03) is not in ASNP it suffices to show that the characteristic 
sequence of $ grows faster than 0{2 P ^), for any polynomial P. We first show that 
03 is a model-complete core. Trivially, 03 is a core, since with each relation also the 
complement of the relation is a relation of 03. To see that 03 is model-complete, let 
4> be a first-order formula that defines a first-order relation R over 03; we have to 
show that R also has an existential definition over 03. By quantifier-elimination of 21 
(recall that 21 has function symbols U, H, c, 0, 1), there is a quantifier-free first-order 
formula tp that defines R over 21. By un-nesting terms in if} with the help of new 
existentially quantified variables, and replacing occurrences of atomic formulas by 
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the corresponding formulas in the language of 23 (for instance replacing formulas of 
the form x D y — z by S(x, y, z) where S is the relation of 23 defined by x n y = z), we 
find the required existential definition of R in 23. By Theorem |3.6.11[ the orbits of 
n-tuples in 23 are primitive positive definable, and so m n equals the number of orbits 
of n-tuplcs of 23. 

We show that m n > 2 2L 1 J . To see this, let I = [n/2\ and X := {xi, . . . , x{\ 
be elements such that for any two distinct subsets S±, S2 of X the elements HSi and 
CiS2 of 21 are distinct. Hence, there are 2 l many elements a\, . . . ,a 2 ; that can by 
formed from xi,...,xi by applying PI, and there are 2 2 many ways of selecting a 
tuple (jjx, . . . ,yi) of elements from {01, . . . , a 2 i}. For any such selection, and since 
the relations {(x,y,z) \ x fl y = z} and {(x, y, z) \ x U y = z} are in 23, the tuple 
(xi, . . . , Xi, yi, . . . , yi) will lie in a distinct orbit, which shows the claim. 



From this example and the proof of Proposition |4.5.6| we can also see the following, 
which we note for later use. 

Corollary 4.5.8. There is no relational structure 23 that can define the atomless 
Boolean algebra and is homogeneous in a finite relational signature. 

4.5.4. CSPs in SNP without an w-categorical template. We have seen in 
Section |L4~3| that every CSP in SNP can also be described by a connected monotone 
SNP. We have also seen a characterization of those CSPs that can be formulated with 



an w-categorical template (Theorem 3.6.22). So it is natural to ask for a concrete 



example of a connected monotone SNP sentence that cannot be formulated with an 
w-categorical template. 

Example 4.5.9. Let $ be the following connected monotone SNP sentence. 

3E Vx, y, z [E(x, x) 

MrE[x,y) VE{y,x)) 
A(-^E{x,y)V^E(y,z)V E{x,z)) 
A(^succ(x 1 u) V ->E(x, y) V ~^succ(y, v) V E(u, v)) 
/\{-^succ(x, u) V -iE(u, v) V ~^succ{y, v) V E{x, y)) 
A(->E(x, y) V ->succ(x, y))j 
The sentence <I> describes CSP((Z; succ)) where succ — {(x, y) £ 1? \ y = x + 1} (as 



in Section 1.7.4.1 ). The idea is that an {E, swcc}-structure satisfies the quantifier- free 
part of if the binary relation E denotes a (symmetric and transitive) relation that 
holds between two vertices when all homomorphisms from the {succ}-reduct of the 
structure to (Z; succ) map those vertices to the same integer. The number of maximal 
pp-n-types is the same in any structure 23 where CSP(23) is described by $, so by The- 
orem 3.6.22 it suffices to check that (Z; succ) has an infinite number of maximal pp-2- 
types. But this is clear since the formula <f> n defined 

A"=l SUCc(xi, Xi-i) 

is for each n in a different pp-2-type. 



CHAPTER 5 



Universal Algebra 



AAAAAAAAA 
AAAAAAAAAA 
AAAAAAAAA 
AAAAAAAAA 
AAAAAAAAAA 
AAAAAAAAAA 




A A A A 
A A A A 
AAA 
AAA 
AAA 
A A A A 



A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A. 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


AA 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


A A A 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


A A A 


A A 


AA 


A A 


A 


A. 


AAA 


A A 


AAA 


AA 


A A 




A. 


A. 






AAA 


A A 


A A 


A A 


A 


A 


AAA 


A A 


AAA 


A A 


A A 


A A 




A 


AAA 


A A 


AAA 




AA 


AA 


A 


A 


AAA 


AA 


AAA 


A A 


AA 


AA 


A 


A 


AAA 




it A A 


AA 


AA 


AA 


A 


A 


AAA 




La a 


AA 


AA 


AA 


A 


A 


AAA 




f&A A 


AA 


AA 


AA 


A 


A 


AAA 




^\A 


AA 


A A 


A A 


A 


A 


AAA 


aL. 


HA A 


AA 


A A 


A A 


A 


A 


AAA 


All 


Taa 


A A 



One of the central concerns of universal algebra, similarly as in mo del- theory, is 
classification of mathematical structures. Often, model-theory is considered to be an 
extension of universal algebra, as formulated by Chang and Keisler in 

model-theory = univeral algebra + logic. 
Universal algebra leads to classification results with finer distinctions: while model 
theory often considers two relational structures to be equal when they are first-order 
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5. UNIVERSAL ALGEBRA 



interdefinable, universal algebra provides methods that allow to distinguish relational 
structures up to primitive positive definability. To do so, we study higher-dimensional 
generalizations of endomorphisms monoids, called polymorphism clones; from the 
perspective of this text, we therefore have 

model-theory = one-dimensional universal algebra. 

The strongest universal-algebraic classification results are available on finite do- 
In recent years, strong links between deep and central questions in 



mams 
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universal algebra and the Feder-Vardi conjecture have led to renewed activity. In 



fact, several important new and purely algebraic results, for example from 13 117 



152 180 , were originally motivated by questions about CSPs. 

There has been less work on algebras over infinite domains. However, a consid- 
erable amount of universal-algebraic techniques also applies when the algebra under 
consideration contains as operations all the permutations from an oligomorphic per- 
mutation group; we will call such algebras oligomorphic. The assumption that the 
algebra be oligomorphic seems to provide the necessary amount of 'finitcncss' that we 
need for applying universal-algebraic methods. 

We have decided to give a self-contained presentation of this section, even though 
that this implies a certain redundancy for the reader who has already followed Chap- 
ter [3] The step from algebras with only unary functions to algebras that contain 
higher-ary functions is the step where universal algebra becomes interesting. At the 
same time, the step from studying automorphisms and embeddings to studying poly- 
morphisms is the step that is new to model-theorists, so we found it natural to divide 
the material into a chapter on model theoretic and a chapter on universal algebraic 
background. 



This chapter contains original material from 24 25 27 28 44 



5.1. Oligomorphic Clones 

Fix a countably infinite base set B, also called the domain or base set. For all 
n > 1, denote by ff^ the set B B " = {/ | B n B] of 71-ary operations on B. Then 
iff := lj n>1 6^ is the set of all operations on B of finite arity. A clone ^ is a subset 
of & satisfying the following two properties: 

• ^ contains all projections, i.e., for all 1 < k < n the operation 7r£ g 
defined by k^(xi, . . . , x n ) = Xk, and 

• ^ is closed under composition, that is, for all / £ 0^ and <?i,... ,g n & 
%f C [m) the operation f(g u . . . , g n ) E 6 {m) defined by 

(x 1 , . . .,x m ) ^ f(gi(xx, . . .,x m ), . . .,g n (xi, . . -,x m )) 

is an element of ^ . 

For our applications of universal algebra in logic, we are interested in clones that 
satisfy an additional topological closure propertjQ A clone ^ is called locally closed^ 
if and only if it satisfies the following interpolation property: 

for all n > 1 and all g G ff {n \ if for all finite A C D n there exists 
an n-ary / 6 c € which agrees with g on A, then g £ c & . 



lr The corresponding topology is defined as follows. Equip B with the discrete topology, and 
ffi n ) = B B with the corresponding product (Tychonoff) topology, for every n > 1. (For background 
in topology, see Chapter [7]) A clone c tf is closed with respect to this topology iff each of its n-ary 
fragments c g n 0^ is a closed subset of 0^ . 

2 In universal algebra, locally closed clones are often just called local clones. Topologists would 
call such objects just closed clones since the reference to the specific topology under consideration 



is clear. Our choice to call those objects locally closed is a compromise, and standard 183 
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The following proposition is folklore in universal algebra, see e.g. 183 . Its proof 



is very similar to the proof of Proposition |3 . 3 . 2| and we leave it to the reader. 

Proposition 5.1.1. Let ^ C ff be a set of operations. Then the following are 
equivalent. 

(1) & is the polymorphism clone of a relational structure; 

(2) & is a locally closed clone. 

Arbitrary intersections of clones are clones, and arbitrary intersections of locally 
closed sets are locally closed. In fact, the set of all locally closed clones on B, partially 
ordered by inclusion, forms a complete lattice. The join of a family (^*)igj can be 
obtained as follows. First, take the set of all operations on B which can be obtained 
by composing operations from Uiez^' this set is a clone, but might not be locally 
closed. For this reason, we have to additionally form the topological closure of this 
set in order to arrive at the join in this lattice. For a set of operations & C 6, we 
write (J?) for the clone locally generated by JF, i.e., for the smallest locally closed 
clone containing 

Definition 5.1.2. Let 'tf C ff be a clone. A unary operation e £ < ta is called 
invertible in c & if there exists a unary i £ ^ such that i(e(x)) = x for all x £ B. 

Suppose that ^ is the polymorphism clone of a structure 03. Then obviously any 
invertible operation of is an automorphism of 03. 

Definition 5.1.3. A clone c £ C 6 is oligomorphic if the set of operations in ^ 
that are invertible in c € forms an oligomorphic permutation group. 

It is immediate from Theorem |3. 1 .4| that a locally closed clone is oligomorphic if 
and only if it is the polymorphism clone of an w-categorical structure. 

5.2. The Inv-Pol Galois Connection 



The exposition in this section parallels that of Section 3.3 Let / be from G^ n \ 
and let R C B m be a relation. Then we say that / preserves R (and that R is in- 
variant under f) iff f(ri, . . . , r„) £ p whenever r\, . . . ,r n £ R, where f{r\, . . . , r n ) is 
calculated componentwise. For a relational structure 03 (or for a set of relations TV) 
with domain B, we write Pol(03) (or Pol(7?.), respectively) for the set of those oper- 
ations in & which preserve all relations from 03 (all relations in TV). The operations 
in Pol(03) are called polymorphisms of 03. Note that the polymorphisms of 03 are 
exactly the homomorphism from finite powers of 03 to 03. 

We have seen how to assign sets of operations to sets of relations; likewise, we 
can go the other way. Given a set of operations & C 0, we write Inv(#") for the 
set of all relations which are invariant under all f £ & " . Using the Galois connection 
defined by the operators Pol and Inv, we obtain the following well-known description 
of the hull operator & ^ Pol(Inv(^")) (confer |183] ). 



Proposition 5.2.1. Let & C ff, and g £ . Then the following are equivalent. 

• g is in the local closure of the operations of the clone generated by jF; 

• For all n > 1 and all ai, . . . , au £ B n there is a f in the clone generated by 
J£" such that g(a\, . . . , a^) = /(ai, • ■ ■ , ctfe). 

• g £ Pol(Inv(Jf)); 

In particular, (j?) — Pol(Inv(jF)). 
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Proof. Note that the set of all operations that are in the local closure of 
the clone generated by is a clone and locally closed. Therefore, the clone (J?") is 
contained in and (1) implies (2). 

For the implication from (2) to (3), let g be a fc-ary operation that is in the local 
closure of the clone generated by & . Let a\, . . . , etfc be from B n for some n > 1. 
Suppose a, = {a\, . . . , a") for i < k, and let a J = (a{, . . . , a^.) for j < n. Since g is in 
the closure of J?', there exists an / G J^"' that agrees with g on {a 1 , . . . , a™} C £> fc . 
In particular, g(ai, . . . ,a fe ) = /(ai, . . . ,a k ). 

For the implication from (3) to (4), assume (3), and let R be from Inv(J£"). 
We have to show that g preserves R. Let ti,...,tk be from R. By assumption 
f(ti, . . . , tk) — g(t±, . . . ,£&) for some operation g generated from operations in & and 
projections. Since all those operations preserve R, we have that f(t\, . . . , t&) € R. 

To show that (3) implies (4), let / be a fc-ary operation from Pol(Inv(J?)). Let 
be the clone generated by & . It suffices to show that for every finite subset A of B 
there is an operation g € ^ such that /(a) = g(a) for all a £ A k . List all elements of 
A k by Oi, ... , a n , and consider the relation {(g(ai), . . . , g(a n )) \ g £ ff}. This relation 
is preserved by all operations in jF. Therefore, / preserves this relation, and so there 
exists g G H as required. □ 

The following is straightforward. 

Proposition 5.2.2. Let 03 be any structure. Then Inv(Pol(Q3)) contains (*B) PP , 
the set of all relations that are primitive positive definable in 03 . 

Proof. Suppose that R is A;-ary, has a primitive positive definition ip, and let 
/ be an Z-ary polymorphism of 03. To show that / preserves R, let be 
tuples from R. Then there must be witnesses for the existentially quantified variables 
xi+i, . . . , x n of ip that show that ip{ti) holds in 03, for all 1 < i < n. Write Si for the 
extension of U such that satisfies the quantifier-free part ip'(xi, . . . ,xi, Xi+i, . . . , x„) 
of ^ (we assume that ip is written in prenex normal form). Then the tuple 

(h(si[l], . . . , si[l)), . . . , /i(si[n], . . . , sj[n])) 

satisfies ^' as well. This shows that (h(si[l], . . . , s;[l]), . . . , h(si[k], . . . , s*[fc])) satisfies 
if) in 03, which is what we had to show. □ 

A relation R has a primitive positive definition in a finite structure if and only 
if R is preserved by all polymorphisms of this structure. This was discovered by 



Geiger 96 and independently by Bodnarcuk et al. 51 , and is of central importance 
in universal algebra. We have the following generalization of this theorem to to- 
categorical structures 43 . 



Theorem 5.2.3 (from |43j ). Let 03 be an lu- categorical or a finite structure. A 
relation R has a primitive positive definition in 03 if and only if R is preserved by all 
polymorphisms o/03; in symbols, Inv(Pol(03)) = (03) pp . 

Proof. One direction has been shown in Proposition |5.2.2"} For the other di- 
rection, let R be a fc-ary relation that is preserved by all polymorphisms of 03. In 
particular, R is preserved by all automorphisms of 03, and hence R is a union of orbits 
of fc-tuples in the automorphism group of 03. By item (2) of Theorem 3.1.4| there is 



a finite number of such orbits, Oi, . . . , O w . If R is empty, there is nothing to show 
(but we again use the assumption that _L is allowed as a primitive positive formula), 
so let us assume that w > 1. Fix for each 1 < j < w a /c-tuple aj from Oj. Let B 
be the domain of 03. Let b\, &2> • • • be an enumeration of all w-tuples in B w with the 
additional property that for 1 < i < k we have bi — (ffli[i], . . . , a w [i]) . 
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By Lemma |3.1.8[ if for every finite substructure 21 of 03 w that contains bi, . . . , bk 
there is a homomorphism from 21 to 03 that maps (&i, . . . , bk) to a tuple that is not in 
R, then there is also a homomorphism from 03 fc to 03 that maps (b\, . . . , &&) to a tuple 
that is not in 7?, and this would be a polymorphism of 03 violating R (the properties 
of a mapping to be a polymorphism and to violate R have universal axiomatizations) . 
So there must be a finite substructure 21 containing the vertices b\ , . . . , bk of B w such 
that all homomorphisms from 21 to 03 map b\ , . . . , bk to a tuple in R. 

Let q±, . . . , qi be the vertices of 21 without bi, . . . , bk- Write ip for the quantifier- 



free part of the canonical query of 21 (see Section 1.2.1). We claim that the for- 
mula 3qi , . . . , qi . ip is a primitive positive definition of R. The above argument shows 
that 3<7i, . . . , qi. tp implies R(b\, . . . , bk)- To show that every tuple in R satisfies 
3qi, . . . , qi. ip, let /: 21 — > 03 be a homomorphism such that the tuple (f(bi), . . . , f(bkj) 
is from R. Then (f(b\), . . . , f(bk)) £ Oj for some 1 < j < w. There is an automor- 
phism a of 03 sending a,j to (/(6i), . . . , f(bk))- So we can extend / to a homomor- 
phism from 03™ to 03 by setting f{x\, . . . , x n ) :— a(xj). This shows in particular that 
f(bi), . . .,f{b k )) satisfies 3q 1: ...,qi.i/). □ 

Analogously to Corollary |3.3.9[ we obtain a Galois connection between structures 
with a first-order definition in an w-categorical structure £, considered up to primitive 
positive interdefinability, and locally closed clones containing Aut(£). 

Theorem 5.2.4. Let £ be a countable u -categorical structure. Then we have the 
following. 

(1) The set of sets of the form (03) pp; where 03 is first-order definable in £, 
ordered by inclusion, forms a lattice. 

(2) The set of locally closed clones that contain Aut(£), ordered by inclusion, 
forms a lattice. 

(3) The operator Inv is an anti-isomorphism between those two lattices, and Pol 
is its inverse. 

The above theorem tells us that classifying the reducts of an w-categorical struc- 
ture £ up to primitive positive interdefinability really amounts to understanding the 
lattice of locally closed clones containing the automorphisms of £. 



5.3. Essential Arity 

This section investigates the bottom of the lattice of locally closed clones ^ that 
contain a fixed oligomorphic permutation group. 

5.3.1. Essentially unary operations. We say that / £ G 1 ^ depends on the 
argument i £ {1, . . . , k} if there is no (k— l)-ary operation /' such that f(x\, . . . , Xk) = 
f'(xi, . . . , Xi-i,Xi+i, . . . , Xk) for all x\, . . . , Xk € B. We can equivalently character- 
ize /c-ary operations that depend on the i-th argument by requiring that there are 
Xi, . . . , Xk S B and x[ G B such that 

f(xi,...,x k ) ^ f(x 1 ,...,x l - 1 ,x / i ,x l+1 ,...,x k ) ■ 

We say that an operation / is essentially unary iff there is an i € {1, . . . , k} and 
a unary operation /o such that f(x±,...,Xk) = fo( x i)- Operations that are not 
essentially unary are called essentiaZp] 



''This is standard in clone theory, and it makes sense also for us, since the essential operations 
are those that are essential for complexity classification. 
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Definition 5.3.1. For any set B, the relations Pg and Pf over B are defined 
as follows. 

P£ = {((a,b, c) G B 3 | a = b or b = c} 
Pf = {((a,b,c,d) E B i \a = b or c = d} 

Lemma 5.3.2. Let f e be an operation. Then the following are equivalent. 

(1) f is essentially unary. 

(2) / preserves P% . 

(3) / preserves Pf . 

(4) / depends on at most one argument. 

Proof. Let k be the arity of /. The implication from (1) to (2) is obvious, since 
unary operations clearly preserve P$ ■ 

To show the implication from (2) to (3), we show the contrapositive, and assume 
that / violates Pf ■ By permuting arguments of /, we can assume that there are an 
I < k and 4-tuples a 1 , . . . , a k € Pf with f(a 1 , . . . , a k ) ^ P 4 B such that in a 1 , . . . , a 1 
the first two coordinates are equal, and in a l+1 , . . . ,a fc the last two coordinates are 
equal. Let c be the tuple (a{, . . . , a[, a± 1 , . . . , a\). Since /(a 1 , . . . , a k ) Pf we 
have f(a{, . . . , of) ^ f{a\, /|), and therefore /(c) ^ f(a{, . . . ,a\ ) or /(c) ^ 
/( a 2r--i4)' Let d = [a\, . . . ,a\) in the first case, and d — {a\, ... ,a k ) in the 
second case. Likewise, we have /(c) ^ f{a\, ■ ■ ■ , a k ) or /(c) ^ /(a|, • ■ • , a k ), and let 
e = (aj, . . . , af) in the first, and e = (a|, . . . , a\) in the second case. Then for each 
i < k, the tuple (d^c^e*) is from P 3 B , but (/(d), /(c), /(e)) ^ P 3 S . 

The proof of the implication from (3) to (4) is again by contraposition. Sup- 
pose / depends on the i-th and j-th argument, 1 < i ^ j < k. Hence there 
exist tuples 01,61,02,62 € D k such that 01,61 and 02,62 only differ at the en- 
tries i and j, respectively, and such that /(ai) 7^ /(61) and /(a2) ^ /(^)- Then 
(oi(Z),6 1 (0,o 2 (/),6 2 (0) € Pf for all J < fc, but (f(a 1 )J(b 1 )J(a 2 ),f(b 2 )) £ Pf , 
which shows that / violates P 3 . 

For the implication from (4) to (1), suppose that / depends only on the first ar- 
gument. Let i < k be maximal such that there is an operation g with f(x%, . . . , Xk) = 
g{x\, . . . , Xi). If i = 1 then / is essentially unary and we are done. Otherwise, 
observe that since / does not depend on the i-th argument, neither does g, and 
so there is an (i — l)-ary operation g' such that for all X\, . . . , x n € D we have 
f(x±, . . . , x n ) = g{x\, . . . , Xi) = g'(x\, . . . , X4-1), contradicting the choice of i. □ 

Combined with Theorem |3.4.7| we obtain for w-categorical structures a character- 
ization of the situation where disjunction can be eliminated from existential positive 
formulas. 

Proposition 5.3.3. Let *B 6e an co-categorical structure, and let be its poly- 
morphism clone. Then the following are equivalent. 

(1) All relations with an existential positive definition in 03 also have a primitive 
positive definition in 93. 

(2) The relation Pjf is contained in Inv( < ^'). 

(3) All operations in c 1o are essentially unary. 



Proof. (1) implies (2). The formula (x = y) V (y = z) is existential positive, 
and thus has a primitive positive definition in 23; such formulas are preserved by ^ . 
(2) implies (3). Follows from Lemma 5.3.2 
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(3) implies (1). Unary operations preserve all existentially positive formulas. 
Hence, when <p is an existential positive formula, then by assumption all polymor- 
phisms of 05 preserve </>, and <f> is equivalent to a primitive positive formula by Theo- 
rem E321 □ 

If all operations of a clone ^ are essentially unary, we say that ^ is essentially 
unary. 

5.3.2. Elementary Clones. If every polymorphism of an w-categorical struc- 
ture 05 is locally generated by the automorphisms of 05, then 05 has the remarkable 
property that every first-order formula is equivalent to a primitive positive formula 
over 05. In this case, the polymorphism clone of 05 is the smallest element of the 



lattice of locally closed clones described in Section 5.2 The facts in this section are 



straightforward combinations of previous results. We state them for future use. 

Corollary 5.3.4. Let 05 be an uj- categorical structure, and let ^ be its polymor- 
phism clone. Then the following are equivalent. 

(1) Every relation with a first-order definition also has a primitive positive def- 
inition in 05. 

(2) 05 is a model- complete core, and P% is primitive positive definable in 05. 

(3) ^ is locally generated by the invertible operations in '. 

(4) All operations in are elementary, i.e., preserve all first-order definable 
relations in 05. 

Proof. (1) implies (2). We assume that every first-order definable relation has 
a primitive positive definition, and hence is preserved by all polymorphisms of 05. In 
particular, the endomorphisms of 05 preserve all first-order definable relations, and 
hence 05 is a model-complete core. Moreover, the relation is clearly first-order 
definable, and therefore also primitive positive definable in 05. 



(2) implies (3). Assume (2). Then Proposition 5.3.3 implies that all polymor- 
phisms of 05 are essentially unary. Thus, for every n-ary polymorphism / of 05 there 
is an endomorphism g of 05 and an j < n such that f(x\, . . . , x n ) = g{xj). Since 
05 is a model-complete core, and by Theorem |3.6.11| g is locally generated by the 
automorphisms of 05, and in particular by the invertible operations in '. Hence, / 
is locally generated by the locally invertible operations in which proves (3). 

(3) implies (4). Invertible operations of ^ preserve all first-order definable rela- 
tions in 05. Hence, the implication follows from Proposition |5 . 2 . l] 



(4) implies (1). By Theorem 5.2.3 □ 



We will see in Corollary 5.5.11 that when the equivalent conditions from Corol- 



lary 5.3.4 apply, then 05 has a finite signature reduct 05' whose CSP is NP-hard. 



5.3.3. Arity Reduction. For many combinatorial arguments with oligomor- 
phic clones it is crucial to have bounds on the arity of operations with certain proper- 
ties. A basic, yet extremely useful observation to obtain such bounds is the following 
(which holds for arbitrary structures 05). 

Lemma 5.3.5. Let 05 be a relational structure and let R be a k-ary relation that 
intersects m orbits of k-tuples of Aut(05). 1/05 has a polymorphism f that violates 
R, then 05 also has an at most m-ary polymorphism that violates R. 

Proof. Let /' be an polymorphism of 05 of smallest arity I that violates R. 
Then there are fc-tuples t\, . . . , i; £ R such that f'(ti, ■ ■ ■ ,£;) ^ R. For I > m there 
are two tuples £j and tj that lie in the same orbit of /c-tuples, and therefore 05 has 
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an automorphism a such that ottj — t^. By permuting the arguments of /', we can 
assume that i = 1 and j — 2. Then the (/ — l)-ary operation g defined as 

g(x 2 , ...,xi) := f'(ax 2 , x 2 ,..., x t ) 
is also a polymorphism of 03, and also violates R, a contradiction. Hence, I < m. □ 



We present applications of Lemma 5.3.5 Recall that r(Sf ) denotes the rank of 



i.e., the number of orbitals of (see Section 3.3) 



Corollary 5.3.6. Let 03 be a structure with an automorphism group Sf . If 03 
has an essential polymorphism, then it must also have an essential polymorphism of 
arity at most 2r(Sf) — 1. 

Proof. The structure 03 has an essential polymorphism if and only if it has 
a polymorphism that violates the relation , where B is the domain of 03, by 
Proposition 5.3.3 The relation P^ consists of at most 2r(Sf) — 1 orbits of triples: 



there are at most r(Sf ) orbits of triples (ti,t2, tz) where t\ = t 2 ^ £3, and at most that 
many where t\ ^ t 2 = t^. Only the orbit of the tuple where t\ — t 2 — t^ is counted 
twice. The statement follows from Lemma [5.3.51 □ 

Corollary 5.3.7. Let 03 be first-order definable in (Q; <), and suppose there is 
a polymorphism of 03 that violates < . Then there is also an endomorphism of 03 that 
violates <. 

Proof. Observe that < consists of a single orbit of pairs in Aut((Q; <)), and 
therefore also in Aut(03). □ 

Corollary 5.3.8. Let & C ff be a local clone that contains a 2-transitive per- 
mutation group <S . If there is an f G & that violates then & contains a constant 
operation. 

Proof. The relation ^ consists of a single orbit of pairs in Sf . Hence, there 
is a unary operation in & that violates ^ by Lemma 5.3.5 The rest follows by 



Lemma EEU □ 

Another application of Lemma |5.3.5| can be found in Section |5.4[ and many ap- 
plications can be found in Chapters |9| and [T0| 

5.3.4. Kara's method. We present another method for showing that an oligo- 
morphic clone with essential operations must contain a binary essential operation. 
The method applies in many cases where the arity bounds from Corollary |5.3.6 



too weak. The idea is taken from 37 , where it has been stated only for structures 



that are preserved by all permutations of the domain, and it has been generalized 



slightly in 46 . To state the method in full generality, we introduce the following, 



apparently new, concept. 

Definition 5.3.9. A permutation group 'S on a set B has the orbital extension 
property if there are orbitals 0\,Oi such that for all 61,62 € B there is an element 
c G B where (c, bi) G 0\ and (c, 6 2 ) G 2 - 

Note that every permutation group with the orbital extension property is transi- 
tive, and that every 2-transitive infinite permutation group has the orbital extension 
property. More examples of oligomorphic permutation groups with the orbital exten- 
sion property are the automorphism group of the Random graph, (Q; <), the univer- 
sal homogeneous poset, the universal homogeneous C-relation, and many more. An 
example of a structure without the orbital extension property is K u>u , the complete 
bipartite graph where both parts are countably infinite. An example of an imprimitive 
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oligomorphic permutation group with the orbital extension property is the automor- 
phism group of an equivalence relation on an infinite set with infinitely many infinite 
classes. 

Lemma 5.3.10. Let ^ be a clone that contains a permutation group Sf with the 
orbital extension property. Then, if contains an essential operation, it must also 
contain a binary essential operation. 

Proof. Let / be an essential operation in and let k be the arity of /. Assume 
without loss of generality that / depends all its arguments and is at least ternary. 
In particular, there are a±, . . . , a k and a[ such that /(oi, . . . , a k ) ^ a2, ■ ■ • , a k )- 
Let 0\ and O2 be the orbitals which exist due to the orbital extension property of Sf . 
We distinguish two cases. 

Case 1. There are 61,..., b k such that (6,,aj) € 0\ for 2 < i < k and 
a 2 , . . . , a k ) 7^ /(61, • ■ • , b k ). Then there are 03, . . . , a k € ^ such that ai(a 2 ) = dj 
and 0:1(62) = hi. We define 

g(x, y) ■= f(x, y, a 3 (y), ... , c*k(y)) , 

which clearly depends on both arguments. 

Case 2. For all 61, . . . , 6fe, if (aj, 6j) € 0\ for 2 < i < k, then f(bx,a 2 , ■ ■ ■ , dk) = 
f(bi, 62, ... , 6fe). Since / depends on its second coordinate, there are ci, . . . , Ck and 
c' 2 such that /(ci, c 2 , c 3 , . . . , c k ) 7^ /(ci, c' 2 , c 3 , . . . , c k ). Then /(ci, a 2 , . . . , a k ) can be 
equal to either /(ci, 0%, C3, . . . , c&), or to /(ci, c 2 , C3, . . . , c^), but not to both. We 
assume without loss of generality that f{c\, 02, . . . , <2fc) 7^ /(ci, c 2 , C3, . . . , c k ). Since 
Sf has the orbital extension property, we can choose 0*2,. ..,dk such that for each 
2 < i < k, the pairs (di,ai) all lie in 0\ and the pairs (di,Cj) all lie in 02- Hence, 
there are 03, . . . ,a k € ^ such that Oj(c 2 ) = q and a^(c?2) = rfj. We claim that the 
operation g defined by 

g(x, y) ■= f{x, y, a 3 (y), a k (y)) 
depends on both arguments. Indeed, we know that g(a\, d 2 ) = 0I2, ■ ■ ■ , d k ) = 

/(01, . . . ,fflfc), and that f(a' 1 ,d 2 ) = f(a[,d 2 , ...,d k ) = f(a' v a 2 , ■ ■ -,a k ). By the choice 
of the values ai,...,a k and these two values are distinct, and we have that g 
depends on the first argument. For the second argument, note that g(c\,d 2 ) = 
f(ci,d 2 ,...,dk) = f(ci,a 2 ,...,a k ) and that g(ci,c 2 ) = f(ci,c 2 ,...,c k ). Because 
/(ci, 02, ... , a k ) and /(ci, c 2 , . . . , c k ) are distinct, we have that g also depends on the 
second argument. □ 

Corollary 5.3.11. Let 25 be 2-transitive with an essential polymorphism. Then 
*B also has a binary essential polymorphism. 

5.3.5. Minimal Operations. In some classification results, e.g. in Chapter[9 
and Chapter |10| it turns out that a bottom-up approach works best: e.g, in Chapter [10 
we first classify all the minimal (with respect to set inclusion) locally closed clones 
that strictly contain the automorphism group of the random graph Aut((V; E)), and 
then the classification argument is organized according to those minimal clones. 

Definition 5.3.12. Let C 6 be a locally closed clone. We say that 

• a local clone S> Q ff is minimal above if 'i§ C and %f C $ C $1 implies 
§ = @ for all locally closed clones S . 

• a function f £ 6 is minimal above if f is not from , and of minimal 
arity such that for every g £ ^ that is locally generated by ffU {/} we have 
that U {<?} locally generates f. 

The following is obvious from the definitions. 
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Proposition 5.3.13. Let c <f be a locally closed clone. Then every minimal oper- 
ation above locally generates a clone that is minimal above % ', and every clone that 
is minimal above 'rf is locally generated by a minimal operation above e ^ ? . 

For oligomorphic clones minimality translates into maximality of Inv(^), and 
we obtain the following. 

Proposition 5.3.14. Let 58 be an to- categorical structure, and let £ be primitive 
positive definable in 58. Then Pol(£) is minimal above Pol(58) if and only if for every 
R € \ (£)pp the structure 58 has a primitive positive definition in (£, R). 



Proof. The equivalence follows from Proposition 5.2.4 □ 



It is well-known that every clone over a finite domain contains a clone S> which 



is minimal above the trivial clone that just contains the projections 76 . The same 
is not true for infinite domains: the clone with domain N that is locally generated by 
the operation x n- x + 1 does not contain a clone that is minimal above the set of all 
projections over N. The situation is again better when e tf is oligomorphic. 

Theorem 5.3.15 (from [25] ) . Let 58 be a finite or uj -categorical structure with a 
finite relational signature, and let 38 be its polymorphism clone. Then any clone c & 
that contains 38 contains a locally closed clone Si that is minimal above 38. 



Proof. By Proposition |5.3.l4] it suffices to show that there is a structure T> 
whose relations are a subset of lnv(38) such that for every R g lnv(38) \ Inv( < ^ p ) there 
is a primitive positive definition of 58 in (2),i?). 

Consider the partially ordered set of all locally closed clones that contain 38 and 
that are contained in ^ ', ordered by inclusion. From this poset we remove 38, which is 
the unique minimal element; the resulting poset will be denoted by P. We claim that 
in P, all descending chains Si D S2 2 • • • & re bounded, i.e., for all such chains there 
exists an S € P such that Sj 3 S for all i > 1. To see this, observe that Uj>i Inv(Si) 
is closed under primitive positive definability in the sense that it can be written as 
(6) pp for some relational structure & (since only a finite number of relations can be 
mentioned in a first-order formula, and since each of the Si is closed under primitive 
positive definability). 

Moreover, there must be a relation R 6 58 that is not contained in Ui>i Inv ( -S^ ) ; 
otherwise, since 58 has finitely many relations, there is a j < uj such that Inv(Sj) 
contains all relations from 58, and hence equals hw{3§), which is impossible since 58 
is not an element of P. Hence, by Corollary |3.3.9[ the structure © has a polymorphism 
/ that is not from 3$. Then 38 U {/} is contained in Hi>i &i an d is a lower bound 
of the descending chain (5,),>o. We can thus apply Zorn's lemma and conclude that 
P contains a minimal element S. Any structure S) whose relations are exactly the 
relations from S satisfies the initial requirements. □ 

For essentially unary oligomorpic clones 3$, we can bound the arity of minimal 
functions above 3S\ this follows from the following. 



Proposition 5.3.16 (from |49] ) . Let 58 be any relational structure with a finite 
number p of orbitals. Then every minimal clone above End(58) is generated by a 
function of arity at most 2 • p — 1 together with End(58). 

Proof. Let & be a minimal clone above End(58). If all the functions in ^ are 
essentially unary, then ^ is generated by a unary operation together with End(58) 
and we are done. Otherwise, let / be an essential operation in c £ . By Lemma [5. 3. 2| 
the operation / violates P® over the domain B of 58; recall that P% is defined by 
the formula (x = y) V (y = z). The subset of P® that contains all tuples of the form 
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(a, a, b), for a,b £ B, clearly consists of p orbits in 03. Similarly, the subset of P% that 
contains all tuples of the form (a, b, c), for a,b £ B, consists of the same number of 
orbits. The intersection of these two relations consists of exactly one orbit (namely, 
the triples with three equal entries) , and therefore Pf is the union of 2 • p — 1 different 
orbits. The assertion now follows from Lemma [5.3. 51 □ 



In Section |8.3| we will see that under further Ramsey-theoretic assumptions on 
the structure 03, there are only finitely many minimal closed clones above the poly- 
morphism clone of 05. 

5.4. Schaefer's Theorem 

Schaefer's theorem states that every CSP for a 2-element template is either in P 



or NP-hard. Via the Inv-Pol Galois connection (Section 5.2 1, most of the classification 
arguments in Schaefer's article follow from earlier work of Post 169 , who classified 
all clones on a two-element domain. We present a short proof of Schaefer's theorem 
here, using the results and ideas from Section [§~3| 

The following operations are important in this context. A ternary function 
/: B 3 — > B is called a minority function if it satisfies f{x,y,y) — f(y,x,y) = 
f(y,y,x) = y for all x,y £ B. It is called a majority function if f(x,x,y) = 
f(x,y,x) = f(y,x,x) — x for all x, y £ B. Note that on Boolean domains, the 
given identities determine / uniquely. A /c-ary operation / is called idempotent iff 
f(x, ...,x)=x. Also recall the definition of NAE and 1IN3. 

NAE ={(0,0, 1), (0, 1,0), (1,0,0), (1, 1,0), (1,0, 1), (1, 1,0)} 

1IN3 ={(0,0,1), (0,1,0), (1,0,0)} 

Lemma 5.4.1. Let f be an idempotent function on the domain {0, 1} that violates 
1IN3. 

• If f is binary, then f must be (x,y) t— > min(x,y) or (x,y) max(x,y). 

• If f is ternary, then f generates min, max, the majority or the minority 
operation. 

Proof. There is nothing to show for binary /. If / is ternary, then the operations 
defined by f(x,x,y), f(x,y,x), and f{y,x,x) must be projections, or otherwise they 
generate min or max and we are done. So we consider the following eight cases. 



Cases 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


f(x,x,y) 


X 


X 


X 


X 


y 


y 


y 


y 


f(x,y,x) 


X 


X 


V 


y 


X 


X 


y 


y 


f(y,x,x) 


X 


y 


X 


y 


X 


y 


X 


y 



In case (1) the operation / is a majority, and in case (8) a minority The cases (2), (3), 
and (5) are impossible since in this case / would preserve 1IN3. In the remaining three 
cases, which are symmetric with respect to permuting arguments of /, the function 
defined by f(x, f(x, y, z), z) is a majority. □ 



The following is well-known; the elegant proof is taken from 67 



Theorem 5.4.2. If R is a Boolean relation preserved by the minority operation, 
then R has a definition by a conjunction of linear equalities modulo 2. 

Proof. The proof is by induction on the arity k of R. The statement is clear 
when R is unary. Otherwise, let R be the boolean relation of arity k — 1 de- 
fined by Rq(x2, ■ ■ ■ , Xk) = R(0, X2, ■ ■ ■ , Xk), and let Ri C {0, l} fe_1 be defined by 
R\(x2, ■ • • , Xk) = P(lj X2, . . . , Xk)- By inductive assumption, there are conjunctions 
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of linear equalities ip and ipi defining R Q and Ri, respectively. If Rq is empty, 
we may express R(x\, . . . , x k ) by x\ = 1 A ipi. The case that R\ is empty can be 
treated analogously. When both R and R\ are non-empty, fix a tuple (c°, . . . , c°) £ 
i?o and a tuple (c^, . . . , € Define c° to be (0, c°, . . . , c ,) and c 1 to be 

(1,C2, . . . ,c°). Let b be an arbitrary tuple from {0, l} fc . Observe that if b £ R, then 
minority(b,c® ,c x ) £ i?, since c° € i? and c 1 G i?. Moreover, if minorityib, c° , c 1 ) € 
i?, then minority(minority(b,c a ,c l ),c° , c 1 ) = b £ R. Thus, b £ R if and only if 
minority (b,c° , c 1 ) G i?. Specializing this to &i = 1, we obtain 

(6 2 , . . . ,6 fe ) € i?i O (minority(b 2 ,c° 2 ,c\), minority(b k ,c k ,cl)) £ R . 

This implies 

£ R 4=> (minority(b 2 , csjfri, c 2 &i), . . . , minority(bk, c^bx, c k bi)) £ Rq . 
Thus, R(x\, . . . , Xfc) is defined by the formula 

3a;-(0 o (4 J --- J 4) A ( a; ' i + c?xi + c, 1 xi = a^)) . 

□ 

A binary relation is called bijunctive if it can be defined by a propositional formula 
that is a conjunction of clauses of size two (aka 2CNF formulas). 



Theorem 5.4.3 (of Post [169] and Schaefer |175| ). Let «8 be a structure over a 



two-element universe. Then either ({0, 1}, NAE) has a primitive positive definition 
in 53, and CSP(Q3) is NP-complete, or 

(1) 58 is preserved by a constant operation. 

(2) 03 is preserved by min. In this case, every relation o/Q3 has a definition by 
a propositional Horn formula. 

(3) 58 is preserved by max. In this case, every relation of 58 has a definition 
by a dual-Horn formula, that is, by a propositional formula in CNF where 
every clause contains at most one negative literal. 

(4) 58 is preserved by the majority operation. In this case, every relation of 58 
is bijunctive. 

(5) 58 is preserved by the minority operation. In this case, every relation of 58 
can be defined by a conjunction of linear equations modulo 2. 

In case (1) to case (5), CSP(58) can be solved in polynomial time. 

Proof. If the relation HNS = {(0, 0, 1), (0, 1, 0), (1, 0, 0)} has a primitive positive 
definition in 58, then Lemma 1.2.6| shows that the NP-hard problem positive l-in-3- 



3SAT |95| (see Example 1.2.21 can be reduced to CSP(58). In this case, also the 
relation NAE is primitive positive definable in 58, as we have seen in Example |1 .2. 7| 

If UN 3 is not primitive positive definable in 58, then by Theorem |5.2.3| there is 
a polymorphism / of 58 that violates R; let / be such an operation of minimal arity. 
Because R consists of three tuples only, Lemma |5. 3. 5| asserts that / is at most ternary. 

If / is not unary, then / : {0, 1} — > {0, 1} defined by x i— » f(x, . . . , x) must preserve 
HNS by the choice of /. Hence, / is the identity and / is idempotent. If / is binary, 
then by Lemma |5.4.1| it is either min or max. In case that / is min, we show that all 
relations in 58 can be defined by propositional Horn formulas. It is well-known that 
positive unit-resolution is a polynomial-time decision procedure for the satisfiability 



problem of propositional Horn-clauses 176 . The case that / is max is dual to this 
case. 

So let R be a Boolean relation preserved by min. Let <j) be a propositional formula 
in CNF that defines <j>. We can assume without loss of generality that for all literals 
in clauses of <fi, when we remove this literal from the clause, the resulting formula is 
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not equivalent to <f>. (Otherwise, we keep on removing literals until the formula has 
the required property.) Now suppose for contradiction that <j> contains a clause C 
with two positive literals x and y. Since <fi is reduced, there is an assignment s\ that 
satisfies 4> such that si(x) — 1, and such that all other literals of C evaluate to 0. 
Similarly, there is a satisfying assignment S2 for <f> such that S2{y) — 1 and all other 
literal s of C evaluate to 0. Then s$ : x i— > min{s\{x), S2(y)) does not satisfy C, and 
does not satisfy <f>, in contradiction to the assumption that min preserves R. 

If / is ternary, then / either generates the minority or the majority operation, 
by Lemma 5.4.1 and the choice of /. If / generates the majority operation, we show 
that every relation of 03 is bijunctive. Hence, in this case CSP(03) is equivalent to 
the 2SAT problem, and can be solved in linear time [7|. Let again be a reduced 
definition of a relation from R, and suppose that 4> contains a clause C with three 
literals l\, 1%, I3. Since <j> is reduced, there must be satisfying assignments si, S2, S3 to 
4> such that under Si all literals of C evaluate to except for li. Then the mapping 
So '• x 1 — y majority{s\{x) , S2{x), Sa(x)) does not satisfy C and therefore does not satisfy 
4>, in contradiction to the assumption that majority preserves R. 

If / generates the minority operation, and R is a relation of 03, then by Theo- 
rem [5A2] the relat ion R has a definition by a conjunction of linear equalities modulo 
2. Then CSP(03) can be solved in polynomial time by Gaussian elimination. 

Finally, if / is unary, then / is either constant, and we are done, or / is the 
operation -1 defined by x H> 1 — x. If / is ->, then NAE consists of three orbits 
of triples. If NAE is primitive positive definable in $, then CSP(03) is NP-hard by 
reduction from positive not-all-equal-3SAT [95] (see again Example |1.2.2 ). Otherwise, 
there is an at most ternary operation g that violates NAE. Since all non-constant 
unary operations preserve NAE, we can assume that g is at least binary. If g is 
binary and violates NAE, then there are t\, ti <E NAE such that to = g{t\, t^) ^ NAE. 
For i £ {1,2}, if ti G HNS, set cti to be id, otherwise set a, to be -1, and note that 
oiiti € HNS. Then either h: (x,y) t-> g(pt\x, a^y) or h: {x,y) M> ->g{axx, ct^y) violates 
HNS, and the statement follows from the above proof when we take h in place of /. 
The argument for ternary g follows the same lines. □ 



Hard Boolean constraint languages can be characterized in many equivalent ways 
via Corollary [5A4l To see this, we need the following proposition. 



Proposition 5.4.4. Let 03 be a structure over a two-element universe. Then the 
following are equivalent. 

(1) ({0, 1}, NAE) has a primitive positive definition in 03; 

(2) 03 is neither preserved by min, max, minority, majority, nor the constant 
operations. 

(3) The polymorphism clone of 03 either contains only projections, or is gener- 
ated by the unary operation x 1— > —x. 

(4) In 03 every first-order formula is equivalent to a primitive positive formula. 

Proof. The implication from (1) to (2) follows from the fact that NAE is not pre- 
served by min, max, minority, majority, and constant operations, which is straight- 



forward to verify. A proof that (2) implies (3) can for instance be found in 67 



(Theorem 5.1). The implication from (3) to (4) follows from Corollary 5.3.4 For 
the implication from (4) to (1), note that NAE is preserved by x 1— > —x, and hence 
preserved by all automorphisms of 03. In particular, NAE is first-order definable in 
03. So (4) implies that NAE also has a primitive positive definition in 03. □ 
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5.5. Pseudo-varieties and Primitive Positive Interpretations 

Primitive positive definability is a strong tool to prove that certain CSPs are 
hard, but in some cases this tool is not strong enough. In this section we discuss the 
concept of primitive positive interpretations, and the matching universal-algebraic 
concept, which is the concept of pseudo-varieties. These concepts suffice to perform 
all hardness proofs in this text purely algebraically. 

5.5.1. Algebras. Algebras have been defined in Chapter [2] they are simply 
structures with a purely operational signature. When A is an algebra with signature 
r and domain A, we denote by Clo(A) the set of all functions with domain A of the 
form (a?x, ■ ■ ■ , x n ) <-> t(xi, . . . , x n ) where t is any term over the signature r whose set 
of variables is contained in {xx, . . . , x n }; clearly, Clo(A) is closed under compositions, 
and contains the projections, and therefore forms a clone. In this section we recall 
some basic universal-algebraic facts that will be used in the following subsections. 

When JC is a class of algebras of the same signature, then 

• P(/C) denotes the class of all products of algebras from JC. 

• P (JC) denotes the class of all finite products of algebras from JC. 

• S(JC) denotes the class of all subalgebras of algebras from JC. 

• H(/C) denotes the class of all homomorphic images of algebras from JC. 

(Products, subalgebras, and homomorphic images have been defined in Chapter [2j) 
Note that closure under homomorphic images implies in particular closure under 
isomorphism. For the operators P, P fin , S and H we often omit the brackets when 
applying them to single algebras, i.e., we write H(A) instead of H({A}). The elements 
of HS(A) are also called the factors of A. 

A class V of algebras with the same signature t is called a pseudo-variety if V 
contains all homomorphic images, subalgebras, and direct products of algebras in 
V, i.e., H(V) = S(V) = P fin (V). The class V is called a variety if V also contains 
all (finite and infinite) products of algebras in V. So the only difference between 
pseudo-varieties and varieties is that pseudo-varieties need not be closed under direct 
products of infinite cardinality. The smallest pseudo-variety (variety) that contains 
an algebra A is called the pseudo-variety (variety) generated by A. 

Definition 5.5.1. Let 05 be a relational structure with domain B. An algebra 
with domain B whose operations are exactly the polymorphisms of 05 is called a poly- 
morphism algebra of 05. 

Note that a relational structure can have many different polymorphism algebras, 



since Definition 5.5.1 does not prescribe how to assign function symbols to the poly- 
morphisms of 05. In our applications, the precise choice of the signature never plays 
a role, and therefore we sometimes refer to the polymorphism algebra of 05, and de- 
note it by Alg(05). So statements about a polymorphism algebra of 05 (or about the 
polymorphism algebra Alg(05)) can typically be translated to statements that hold 



for all polymorphism algebras of 05 (e.g. in Theorem 5.5.6 below). 

Also note that when 05 is w-categorical, then the signature of the polymorphism 
algebra has cardinality 2 U . This follows directly from Theorem |7.3.2| and the remark 
after Lemma [3.1. 101 

Congruences and Quotients. When [i : C —> D is a map, then the kernel of 
[i is the equivalence relation E on C where (c, c') £ E if /i(c) = /i(c'). For c £ C, we 
denote by c/E the equivalence class of c in E, and by C/E the set of all equivalence 
classes of elements of C . A congruence of an algebra A is an equivalence relation 



that is preserved by all operations in A. The results in Section 5.2 show that for 
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w-categorical structures 03 with polymorphism algebra B, the congruences of B are 
exactly the primitive positive definable equivalence relations over 03. 



PROPOSITION 5.5.2 (see |63| ). Let A be an algebra. Then E is a congruence of 
A if and only if E is the kernel of a homomorphism from A to some other algebra B. 

When A is a r-algebra, and /i : A — > B is a mapping such that the kernel of is a 
congruence of A, we define the quotient algebra A//i of A under h to be the algebra 
with domain n(A) where 

f A ^(h(a 1 ),...,h(a k ))=^f A (a 1 ,...,a k )) 

where a±, . . . , a k G A and / G r is fc-ary. This is well-defined since the kernel of /i is 
preserved by all operations of A. Note that fi is a surjective homomorphism from A 
to A/fj,. The following is well-known. 

Lemma 5.5.3 (The Homomorphism Lemma). Let A be a r-algebra, let K be a 
congruence of A, and let [X\ : A — > B\ and /12 : A —¥ B2 be two mappings with kernel 
K. Then A//ii is isomorphic to A//i2- 

The following is also well known (see e.g. Theorem 6.3 in |63| ). 

Lemma 5.5.4. Let A and B be algebras with the same signature, and let fi : A — > 
IS be a homomorphism. Then the image of any subalgebra A' of A under h is a 
subalgebra o/B, and the preimage of any subalgebra B' o/B under /1 is a subalgebra 
of A. 

PROOF. Let / G r be fc-ary. Then for all a±, . . . , a k G A', 

/ B (M(a 1 ),...,/i(a fc )) = n(f A ( ai ,...,a k )) G h(A') , 

so n(A') is a subalgebra of C. Now suppose that fi(ai), . . . , fi(a k ) are in B'; then 
/ B (/i(ai), . . . ,n(a k )) G B' and hence n(f A {a u . . . , a k )) G B 1 . So, f A {a x , ■ ■ ■ , a k )) G 
fi^ 1 (B') which shows that ^i~ 1 (B') induces a subalgebra of A. □ 

5.5.2. Primitive Positive Interpretations. In Chapter |3j we have seen that 
first-order interpretations are a convenient tool to construct w-categorical structures 
from other w-categorical structures. Primitive positive interpretations are interpreta- 
tions I where the domain formula Si and all the defining formulas <f>j are primitive 
positive. As we will see, such interpretations can be used to study the computational 
complexity of constraint satisfaction problems. 

Definition 5.5.5. Let L be an interpretation. If the domain formula Si and 
the interpreting formulas 4>i are primitive positive, then we say that I is a primitive 
positive interpretation. 

We first start with a result that is known for finite domain constraint satisfaction, 



albeit not using the terminology of primitive positive interpretations 61 . In the 
present form, it appears first in [24) . 

Theorem 5.5.6. Let 03 and £ be structures with finite relational signatures. If 
there is a primitive positive interpretation of 03 in €, then there is a polynomial-time 
reduction from CSP(03) to CSP(C). 

Proof. Let d be the dimension of the primitive positive interpretation / of the 
r-structure 03 in the <7-structure £, let 5i(x±, . . . , x^) be the domain formula, let 
h: Si(€ d ) — > -D(03) be the coordinate map, and let </>i(xi, . . . , Xd k ) be the formula for 
the fc-ary relation R from 03. 
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Let 4> be an instance of CSP(Q3) with variable set U = {x\, 
an instance ip of CSP(£) as follows. For distinct variables V 
ipi to be the formula 

l<i<n 

Let ip2 be the conjunction of the formulas Oi(y] , . . . , yf , . . . , yj , . . . , yf ) over all 
conjuncts 6 — R(x^ , . . . , x; fc ) of (p. By moving existential quantifiers to the front, the 
sentence 

3y\,...,y d n (cf>i A 1P2) 

can be re- written to a primitive positive cr-formula ip, and clearly ip can be constructed 
in polynomial time in the size of 21. 

We claim that <p is true in 03 if and only ip is true in £. Let C be the domain of 
£, B the domain of 03, and suppose that /: U — > C satisfies all conjuncts of ip in £. 
Hence, by construction of ip, if <p has a conjunct 9 — R(xi t , . . . ,Xi k ), then 

£ |= Wfe • • • , fivfj), • • • , (/GO, • ■ ■ ' • 

By the definition of interpretations, this implies that 

•8 h= WGO, • • • ; • • • - Hfivl), /«))) • 

Hence, the mapping g: U — >• £? that sends to h(f(yj), . . . , f(y()) satisfies all con- 
juncts of in *B. 

Now, suppose that f:U^B satisfies all conjuncts of <f> over 03. Since h is 
a surjective mapping from 5(<L d ) to B, there are elements cj, . . . ,cf in £ such that 
h{c\, . . . , cf) = f(xi), for all i £ {1, . . . , n}. We claim that the mapping g : V — > C 
that maps y\ to is a homomorphism from ip to €. By construction, any constraint 
in ip either comes from ip\ or from ip2- If it comes from ip\ then it must be of the 
form Si(yj, . . . , yf), and is satisfied since the pre- image of h is <5/(£ d ). If the constraint 
comes from ip2, then it must be a conjunct of a formula 0i(yl i , . . . , yf x , . . . , y\ , . . . , yf ) 
that was introduced for a constraint 9 = R(xi i: . . . ,Xi k ) in 21. It therefore suffices to 
show that 

€^e I (g(yl 1 ),...,g(yi),...,g(yl),...,g(yf k )). 

By assumption, R(f(x il ), . . . ,f(x ik )) holds in 23. By the choice of c\, . . . ,c d , this 
shows that R(h(cj , . . . , cf ), . . . , h{c\ , . . . , cf )) holds in €. By the definition of in- 
terpretations, this is the case if and only if 0i(c} i ,cf, . . . , c\ , . . . , cf ) holds in £, 
which is what we had to show. □ 



. . . ,x n \. We construct 

: = {yh---,yt}, we set 



We describe how to compose interpretations, and observe that compositions of 
primitive positive interpretations are again primitive positive. Note that if £2 has a 
di-dimensional interpretation I\ in £1, and £3 has an (^-dimensional interpretation 
I2 in £2, then £ 3 has a natural ((^(^-dimensional interpretation in £ l5 which we 
denote by I2 ° I\ ■ To formally describe I2 ° I\ , suppose that the signature of £^ is r, 
for i = 1, 2, 3, and that I\ = (d\, S\, h\) and I2 = {d,2, S2, ^2)- When <p is a T2-formula, 
let <pi 1 denote the Ti-formula obtained from cp by replacing each atomic T2 formula ip in 
<p by the ri-formula ipi 1 . Note that when cp is primitive positive (existential positive), 
and the interpreting formulas of I\ are primitive positive (existential positive), then 
<pj 2 is again primitive positive (existential positive )FJ 



Note that this is in general false for existential formulas: there are existential formulas and 
existential interpretations I\ such that (f>j 1 is no longer existential. 
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Now the interpretation I 2 oIi is given by (did 2 , S, h) where S := (Sj 2 )i 1 ((£i) dld2 ), 
and where the coordinate map h : S — > £3 is defined by 

{a\,...,ai,...,a l e ,...,ai) >-> h 2 (hx(a{, . . . , af), . . . , h^a^, . . . , af)) . 

We have already observed the following. 

Lemma 5.5.7. When I± and I 2 are primitive positive interpretations, then I 2 ° I\ 
is also primitive positive. 

In many hardness proofs we use Theorem |5.5.6| in the following way. 

Corollary 5.5.8. Let 03 be an lo- categorical relational structure. If there is a 
primitive positive interpretation of ({0, 1}; 1IN3) or ({0, 1}; NAE) in 03, then 03 has 
a reduct with finite signature whose CSP is NP-hard. 

PROOF. The primitive positive formulas involved in the primitive positive inter- 
pretation can mention only finitely many relations from 03. Let 03' be the reduct of 
03 that contains exactly those relations. Then NP-hardness of CSP(03') follows from 
the NP-hardness of CSP( ({0,1}; 1IN3)) and C SP(({0, 1}; NAE)) (see Section [O] in 
Chapter [TJ Example [L2~2| ) via Theorem |5. 5. 6| □ 



We present an application of Theorem |5.5.6| and prove a hardness result that 
becomes useful at several occasions in later sections. 

Definition 5.5.9. For any set B , we write Eq for the relation 

{(x 1 ,x 2 ,yi,y 2 , 21,22) G B 6 | (xi = x 2 A yi ^ y 2 A z\ 7^ z 2 ) 

V (xi ^ x 2 A yi = y 2 A z\ ^ z 2 ) 

V (xi ^ x 2 A yi ^ y 2 A Z\ = z 2 )} . 

PROPOSITION 5.5.10. For any set B with \B\ > 2, the structure ({0, 1}; UN 3) 
has a primitive positive interpretation in (B;Eq), and CSP((B; Eq)) is NP-hard. 

Proof. Recall that 1IN3 denotes the boolean relation {(1, 0, 0), (0, 1, 0), (0, 0, 1)}. 
We give a primitive positive interpretation / of the structure 53 := ({0, 1}; 1IN3) in 
(B;Eg). The dimension of / is 2, and the domain formula is Si :— T (for true). 
The formula lIN3{x\, x 2 , y%, y 2) Z\, z 2 )j is E§(xi, x 2 , yi, y 2 , z\, z 2 ), and the formula 
=/ (xi,x 2 ,y 1 ,y 2 ) is 

3a 1 ,a 2 ,u 1 ,u 2 ,u 3 ,u 4 ,z 1 ,z 2 (01 = a 2 A E^ (a 1 ,a 2 ,u 1 ,u 2 ,u 3 ,u 4 ) 

A Ef(ui,u 2 ,xi,x 2 ,zi,z 2 ) A E%{u3,Ui,zx,z 2 ,yi,y 2 )) . 

Note that the primitive positive formula =/ (xi, x 2 ,yi, y 2 ) is equivalent to x\ = x 2 
2/i = J/2- The map h maps (61,62) G B 2 to 1 if b 4 = b 2 , a nd to otherwise. NP- 



hardness of CSP(( J B; Eg)) then follows from Corollary 5.5.8 □ 



Corollary 5.5.11. Let 03 be an u>- categorical structure where all first- order for- 
mulas are equivalent to primitive positive formulas ( or that satisfies some of the other 



equivalent conditions from Corollary 5.3.4). Then there is a primitive positive in 



terpretation of ({0, 1}; 1IN3) in 03 ; and 03 has a finite signature reduct 03' such that 
CSP(03') is NP-hard. 

Proof. Since Ef is first-order definable, it also has a primitive positive definition 



in 03 by assumption. Proposition 5.5.10 implies that the structure ({0, 1}; 1IN3) has 



a primitive positive interpretation in 03. The last part of the statement follows from 



Corollary 5.5.8 □ 
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In fact, we could have weakened the assumptions in Corollary |5 . 5 . 1 1| by only re- 
quiring that all polymorphisms of 03 are essentially unary, and that all endomorphisms 
of 03 are injective, because it is then easy to see that the relation Eg is preserved by all 
polymorphisms of 03, and hence primitive positive definable in 03, by Theorem |5. 2. 3 



There are many situations where Theorem 5.5.6 can be combined with Lemma 1.2.8 



to prove hardness of CSPs, as described in the following. 

Proposition 5.5.12. Let 21 be a structure with finite relational signature, and let 
03 be a structure with elements c\ , . . . , Ck such that 

• the orbit of (ci, . . . , c&) in 03 is primitive positive definable, and 

• 21 has a primitive positive interpretation in (03, Ci, . . . , C&). 

Then there is a finite signature reduct 03' of 03 and a polynomial-time reduction from 
CSP(2t) to CSP(03'). 

Proof. Let £ denote the expansion of 03 by the unary relations {ci}, . . . , {ck}- 
Then the interpretation of 21 in (03, c±, . . . , Ck) shows that there is also a primitive 
positive interpretation of 21 in £, and this interpretation mentions only finitely many 
relations of (£. Let <£.' be the finite signature reduct of £ that contains exactly those 
relations and the relations that are mentioned in the primitive positive definition of the 
orbit of (ci, ... ,Cfc). Since C still interprets 21, there is a polynomial-time reduction 



from CSP(2l) to CSP(C) by Theorem 5.5.6 Since there is still a primitive positive 



definition of the orbit of (ci, . . . , Ck) in we can apply Corollary 3.6.26 and get a 
polynomial-time reduction from CSP(C) to CSP(03'), where 03' is the reduct of 03 
that only contains the relations that are also in note that 03' has finite signature. 
Composing reductions, we conclude that there is a polynomial-time reduction from 
CSP(2t) to CSP(03'). □ 

Together with Corollary |3. 6. 26| we have the following consequence. 

Corollary 5.5.13. Let 03 be an u>- categorical relational structure, and let 21 be 
a structure with a finite signature and a hard CSP. // there is a primitive positive 
interpretation of$l in the expansion of the model- complete core o/03 by finitely many 
constants, then 03 has a reduct with finite signature whose CSP is NP-hard. 



We give an application of this technique in Proposition 



applications can be found in Section |9.2| and Section |10.2.3 



5.5.14 below. Many more 



Proposition 5.5.14. The structure ({0, 1}; NAE) has a primitive positive inter- 
pretation in (Q; Betw, 0), and CSP ((Q; Betw)) is NP-hard. 

PROOF. Recall that the relation NAE is {0, l} 3 \ {(0, 0, 0), (1, 1, 1)}. The dimen- 
sion of our interpretation / is one, and the domain formula is 3z. Betw(x, 0, z), which 
is equivalent to i ^ 0. The formula —j {xi, yi) is 

3z ( y Betw{x\, 0, z) A Betw(z, 0, yi)) . 

Note that =/ is over (Q; Betw, 0) equivalent to {x\ > y > 0). Finally, the formula 
NAE{x 1 ,y\,z\)i is 

3u i y Betw{x\, u, yi) A Betw(u, 0, zi)) . 

The map h maps positive points to 1, and all other points from Q to 0. 

Since the orbit of is the entire set Q it is in particular primitive po sitive d efinable, 
and we can show NP-hardness of CSP ((Q; Betw)^J using Proposition 



5.5.12 



and the 



fact that CSP (({0, 1}; NAE)) is NP-hard. □ 
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5.5.3. Pseudo-varieties. We present the mentioned connection between prim- 
itive positive interpretations and pscudo- varieties. 



Theorem 5.5.15 (from [24]). Let £ be a finite or uj -categorical relational struc- 
ture, and let C be a polymorphism algebra of £. Then a structure 03 has a primitive 
positive interpretation in £ if and only if there is an algebra B in the pseudo-variety 
generated by C such that all operations of B are polymorphisms of%$. 

PROOF. Let r be the signature of C, and let V be the pseudo-variety generated 
by C. Similarly to the famous HSP theorem for varieties (see e.g. [63] ), every algebra 
in V is the homomorphic image of a subalgebra of a finite direct product of C. To 
see this, we have to verify that HSP fin (C) is closed under H, S, and P fin . It is 



clear that H(HSP fin (C)) = HSP fin (C). Lemma |5.5.4] implies that S(HSP fin (C)) C 
HSSP fin (C) = HSP fin (C). Finally, 

pfi"(HSP fin (C)) CHP fin SP fin (C) CHSP fin P fin (C) =HSP fin (C) . 

First assume that there is an algebra B in V all of whose operations are polymor- 
phisms of 03. Then there exists a finite number d > 1, a subalgebra D of C d , and a 
surjective homomorphism h from D to B. We claim that 03 has a first-order inter- 
pretation / of dimension d in £. All operations of C preserv e D (v iewed as a d-ary 



relation over £), since D is a subalgebra of C d . By Theorem 5.2.3 this implies that 
D has a primitive positive definition 8{x\, . . . ,Xd) in £, which becomes the domain 
formula 5i of /. As coordinate map we choose the mapping h. 
If R is a fc-ary relation in 03, let R' C C dk be defined by 

(al,...,af,...,al,...,ai) G R' & {h{a{, . . . ,af), . . . ,h(al, . . . ,oj£)) G R . 

Let / G r be arbitrary. By assumption, / B preserves R. It is easy to verify that 
then f c preserves R' . Hence, all polymorphisms of £ preserve R' , and because £ is 



w-categorical, the relation R' has a primitive positive definition in £ (Theorem 5.2.3), 
which becomes the defining formula for R(x±, . . . , Xf.) in I. Finally, since h is an alge- 
bra homomorphism, the kernel K of h is a congruence of D. It follows that K, viewed 



as a 2<i-ary relation over C, is preserved by all operations from C. Theorem 5.2.3 
implies that K has a primitive positive definition in £. This definition becomes the 
formula =/. It is straightforward to verify that / is a primitive positive interpretation 
of 23 in t. 

To prove the opposite direction, suppose that 03 has a primitive positive interpre- 
tation / in £. We have to show that V contains a r-algebra B such that all operations 
in B are polymorphisms of 03. Let d be the dimension and 5 be the primitive positive 
domain formula of /. Clearly, the set 8{€. d ) is preserved by all operations in C, and 
therefore induces a subalgebra D of C d . 

We first show that the kernel K of the coordinate map h of the interpretation is a 
congruence of D. For all c?-tuples a,b £ D, the 2d-tuple (a, b) satisfies =/ in £ if and 
only if h(a) = h(b). Let S be the 2d-ary relation defined by =/ over £. Then S can be 
viewed as a binary relation over C d , and we have S (1 D 2 = K. Since =/ is primitive 
positive definable in £, S is preserved by all polymorphisms of £. To show that K 
is a congruence of D, let / € r be fc-ary, and let (a 1 , b 1 ), . . . , (a k , b k ) be pairs from 
K. Let a = / D (a\ . . . , a k ) and b = / D (6 1 , . . . , b k ). We have to show that (a, b) G K. 
Since D is a subalgebra of C d , a,b G D, and hence it suffices to show that (a, b) G S. 
Recall that / D is defined by applying f c component-wise. Since {a 1 ,b l ) G S for all 
i < k and f c preserves S, we thus have that (a, b) G S. Hence, K is a congruence of 
D and ft, is a surjective homomorphism from D to B := D/h. 

We finally verify that every operation in B is a polymorphism of 03, i.e., for 
every / G r, every relation R of 03 is preserved by / B . The operation / B preserves 
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4> '■= R(xi, . . . , Xk) if and only if / c preserves </>/. Since / is a polymorphism of C, 
and since </>/ is primitive positive over £, the operation f c indeed preserves </>/. □ 



The proof of Theorem |5.5.15| above gives more information about the link between 
the algebras in HSP fin (Alg(T)) and the primitive positive interpretations in T, and 
we state it explicitly. 

Theorem 5.5.16. Let € be a finite or uj -categorical structure, and let 03 be an 
arbitrary structure. Then the following are equivalent. 

(1) there is a polymorphism algebra C o/C, an algebra S € 8(0^) with domain 
S, and a surjective homomorphism h from S to an algebra B such that 
Clo(B) C Pol(Q3); 

(2) 03 has the primitive positive interpretation (d, S, h) in (£. 

We return to applications of these concepts to CSPs. 

Corollary 5.5.17. Let 03 be lo-categorical. If there is an expansion £ of the 
model- complete core of 03 by finitely many constants such that the pseudo- variety V 
generated by Alg(£) contains a 2-element algebra where all operations are projections, 
then 03 has a finite signature reduct with an NP-hard CSP. 

Proof. Let D be the 2-element algebra in V where all operations are projections. 



All operations of D preserve the relation 1IN3. By Theorem 5.5.15 the structure 



({0, 1}; 1IN3) has a primitive positive interpretation in C Then Corollary 5.5.8 shows 



that 03 has a finite signature reduct 03' with an NP-hard CSP. □ 

All w-categorical templates known to the author that have an NP-complete CSP 
satisfy the condition from Corollary |5. 5. 17| For finite templates 03 there is the con- 
jecture (and strong evidence) that CSP(03) is NP-hard if and only if 03 satisfies this 



condition (see Section 5.6.1) 



Theorem 5.5.18. Let 03 be any structure. Then the following are equivalent. 

(1) there is a primitive positive interpretation of ({0, 1}; 1IN3) in 03. 

(2) there is a primitive positive interpretation of ({0, 1}; NAE) in 03; 

(3) 03 interprets a structure with at least two elements where all first-order for- 
mulas are equivalent to primitive positive formulas; 

(4) all finite structures have a primitive positive interpretation in 03. 

// 03 is uj-categorical, the following two conditions are equivalent to the conditions 
above. 

(5) the pseudo-variety V generated by Alg(03) contains for all n a finite algebra 
on n elements all of whose operations are projections; 

(6) the pseudo-variety V generated by Alg(03) contains a 2-element algebra all 
of whose operations are projections. 

Proof. The first statement can be shown by proving implications in cyclic order, 
(1) (2) (3) => (4) =$> (1). Obviously, (4) implies (1). We have given a primitive 



positive definition of NAE in ({0, 1}; 1IN3) in the proof of Theorem 5.4.3 which 
implies that (1) implies (2). The implication from (2) to (3) is by Proposition 5.4.4 

For the implication ^ =>■ Q, let 03' be the structure that has a primitive positive 
interpretation in 03, has at least two elements, and where all first-order formulas 
are equivalent to primitive positive formulas. Let 21 be a r-structure with domain 
{1, . . . , n}. We prove that 21 has a first-order interpretation in 03'. This yields in fact 
a primitive positive interpretation since every first-order formula is equivalent to a 
primitive positive formula in 03'. The claim then follows by composing the primitive 



positive interpretation of 03' in 03 with that of 21 in 03', via Lemma 5.5.7 
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Our first-order interpretation / of 21 in 03' is 2n-dimcnsional. The domain formula 
8i{x\, . . . , x n , x'i, . . . , x' n ) expresses that for exactly one i < nwe have Xi = x\\ clearly 
this is first-order. Equality is interpreted by the formula 

n 

=i (xi, . . . ,x n ,x[,. . . ,x' n ,yi,. . . ,y n ,y[, . . . ,y' n ) = ( f\(x l = a;-) (y % = y-)) ■ 

»=i 

Note that the equivalence relation defined by =j on <5((03') 2 ™) has exactly n equiva- 
lence classes, and the coordinate map sends (xx, . . . , x n , x±, . . . , x' n ) to i if and only 
if Xi = x\. It is now straightforward to write down first-order formulas </>/ that in- 
terpret atomic r-formulas <j). When R £ r is fc-ary, then the formula R(x\, . . . , Xk)i 
is a disjunction of conjunctions with the 2nk variables Xx t i, . . . , x±_ n , x[ 1: . . . , x[ n , 
. . . , Xk,i, ■ ■ ■ , Xk, n , x 'k i) ■ ■ ■ i x 'k n- ^ or eacrl tuple (t\,..., tk) from i? a , the disjunction 
contains the conjunct [\ i<k %i,ti — x' it .. 

Now suppose that 03 is w-categorical. We prove that Q implies ([5]), that ^ 
implies pj, and that ([6]) implies (fl]). For Q =^ let 21 be the structure with domain 
A = {1, . . . ,n}, the relations P^%and for each i £ n the unary relation {i}. By Q 
there is a primitive positive interpretation of 21 in 03. Hence, Theorem |5 . 5 . 1 5 1 implies 
that there is an algebra A' £ V such that all operations of A' are polymorphisms 



of 21. But all polymorphisms of 21 are projections (Corollary 5.3.4). The implication 
(j5J) ^ is trivial. The implication |6]) follows from Theorem 5.5.15 and the 



fact that the projections preserve 1IN3. □ 

5.5.4. Bi-interpretations and Classification Transfer. Let £be a structure 
with finite relational signature. By the classification project for £ we mean a com- 
plexity classification for CSP(03) for all first-order expansions 03 of £ that have finite 
relational signature. For instance, the classification project for the random graph 
(Y;E) is treated in Chapter [9j and the classification project for (Q; <) is treated in 
Chapter [10] 

Sometimes, it is possible to derive the complexity classification project for £ from 
the complexity classification project for ID, for another w-categorical structure D. For 
instance, we will show below how to derive the classification project for the directed 
graph 

C:= (N 2 ;{(x,y),(u,v) \y = u}) 
from the classification project for D := (N; =) (which will be given in Chapter [6]); 
a more advanced application of such a classification transfer can be found in Theo- 
rem 15.5.241 below. 

Primitive positive interpretability is a crucial concept for the transfer of complex- 
ity classifications. In particular, this section studies primitive positive bi-interpretations 
in this context. Two interpretations of £ in J) with coordinate maps hi and h 2 are 
called homotopi^ if the relation {(x,y) | hx(x) — ti2(y)} is first-order definable in 
D. If this relation is even primitive positive definable in D, we say that the two in- 
terpretations are pp-homotopic. The identity interpretation of a r-structure £ is the 
interpretation / = (l,true, h) of £ in £ whose coordinate map h is the identity (note 
that the identity interpretation is primitive positive). Recall that we write I\ o I 2 for 
the natural composition of two interpretations I\ and I 2 , defined in Section [5.5. 2| 

Definition 5.5.19. Two structures £ and D with an interpretation I of £ in T) 
and an interpretation J offtin'D are called mutually interpretable. If both I o J and 
Jo I are homotopic to the identity interpretation (of D and of ' £, respectively), then 
we say that £ and 2) are bi-interpretable. 



^We are following the terminology from [sj. 



no 
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When both interpretations I and J are primitive positive, then £ and 2) are called 
mutually pp-interpretable. If moreover IoJ and Jol are pp-homotopic to the identity 
interpretation, then £ and 2) are called primitive positive bi-interpretable. 

Example 5.5.20. The directed graph £ := (N 2 ;M) where 

M := {((iti,u 2 ), (vi,v 2 )) | m 2 = vi} 

and the structure 2) := (N; =) are primitive positive bi-interpretable. The interpre- 
tation / of £ in 2) is 2-dimcnsional, the domain formula is true, and the coordinate 
map h is the identity. The interpretation J of 2) in £ is 1-dimensional, the domain 
formula is true, and the coordinate map g sends {x, y) to x. Both interpretations are 
clearly primitive positive. 

Then g(h(x,y)) = z is definable by the formula x — z, and hence I o J is pp- 
homotopic to the identity interpretation of 2). Moreover, h(g(u), g(v)) = w is primi- 
tive positive definable by 

M(w,v) A 3p(M(u,p) A M(w,p)) , 
so J o 7 is also pp-homotopic to the identity interpretation of < <f . □ 

Example 5.5.21. The structures £ :— (N 2 ; {(x, y), (u, v) | x = u}) and D := 
(N; =) are mutually primitive positive interpretable, but not primitive positive bi- 
interpretable. There is a primitive positive interpretation 1\ of 2) in £, and a primitive 
positive interpretation of £ in 2) such that I2 ° I\ is pp-homotopic to the identity 
interpretation. However, the two structures are not even first-order bi-interpretable, 



as we will see in Example 7.4.2 in Section 7.4 □ 



Here comes the central lemma for complexity classification transfer. 

Lemma 5.5.22. Suppose 2) has a primitive positive interpretation I in £, and £ 
has a primitive positive interpretation J in 2) such that J o I is pp-homotopic to the 
identity interpretation of £. Then for every first-order expansion £' of £ there is a 
first-order expansion 2)' of 2) such that £' and 2)' are mutually pp-interpretable. 

Proof. Let / — (c,U,g) and J = (d,V,h) be the primitive positive interpreta- 
tions from the statement, and let £' be a first-order expansion of £. Then we set 2)' 
to be the expansion of 2) that contains for every fc-ary R in the signature of £' the 
(c?A:)-ary relation S defined as follows. When 4> is the first-order definition of R in £, 



then S is the relation defined by cf>j in 2) (see Section 5.5.2 1. 

We claim that £' has the primitive positive interpretation (d, V, h) in 2)'. First 
note that V is primitive positive definable in 2)' since 2)' is an expansion of 2). 
An atomic formula ip with free variables x±,...,Xk in the signature of £' can be 
interpreted in 2)' as follows. We replace the relation symbol in ip by its definition in 
£, and obtain a formula </> in the language of £. Let S be the symbol in the language 
of 2)' for the relation defined by <t>j(x{, . . . , xf , . . . , x\, . . . , xf) over 2)'. Then indeed 
S(x\, . . . , xf, . . . , x\, . . . , xf) is a defining formula for ip, because 

£' h i>(h(al, . ... of), ... , g{a\, . . . , a d k )) & 2)' |= S{a\, . . . , < . . . , 4, . . . , aft 

for all <2i, . . . , afc G V. 

Conversely, we claim that 2)' has the primitive positive interpretation (c, U, g) 
in £'. Again, U is primitive positive definable in £' since £' is an expansion of 
£. Let cj) be an atomic formula in the (relational) signature of 2)'. If the rela- 
tion symbol in ip is already in the signature of 2), then there is a primitive posi- 
tive interpreting formula in £ and therefore also in £'. Otherwise, by definition of 
2)', the relation symbol in <fi has arity dk, and has been introduced for a fc-ary re- 
lation R from £'. We have to find a defining formula having kcd variables. Let 
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0(xo, x± i, . . . , x± >c , . . . , Xd,i, ■ ■ ■ , x c .d) be the primitive positive formula of arity cd+ I 
that shows that h(g(xi t i, . . . , x Ct i), . . . , g(xi t d, ■ ■ ■ , x c ,d)) — xq is primitive positive de- 
finable in C Then the defining formula for the atomic formula 4>(x\, . . . , x^j has free 
variables x\ l5 . . . , x\ d and equals 

k 

3a; 1 , ... , x k (R(x\ . . . , x k ) A f\ 9(x\ x\ A , . . . , . 

i=i 

□ 



In particular, when £, D, £' and D' are as in Lemma [5. 5.22 and £' and S' have a 



finite relational signature, then CSP(03) and CSP(OS') have the same computational 
complexity, by Theorem 5.5.6| Hence, Lemma |5.5.22| shows that the classification 



project for £ can be reduced to the classification project for £>. With a slightly 
stronger assumption we can get the following consequence. 

Corollary 5.5.23. Let € and® be primitive positive bi-interpretable u -categorical 
structures. Then every structure with a first-order definition in £ is primitive positive 
bi-interpretable with a structure that has a first-order definition in 2). 



More about first-order bi-interpretability can be found in Section 7.4 Let us 
conclude with a concrete application of Corollary |5.5.23| 



Theorem 5.5.24. Let 03 be a reduct of Allen's interval algebra ( Example \3.1.11) 



that contains the relation m = {((711,1*2), (fi,^)) |u2 = i>i}. Then CSP(Q3) is either 
in P or NP-complete. 

Proof. We show that the structure (I;m) is primitive positive bi-interpretable 
with (Q; <). It follows that 03 is primitive positive bi-interpretable with a temporal 



constraint language 03', and the result follows by the main result of Chapter 10 and 
Corollary [5X231 

Let / be the 2-dimensional interpretation of (I; m) in (Q; <) with domain formula 
x < y, the formula {y\ = 2/2)/ is true, and the formula (m(yi, 2/2))/ has variables 
x\,x\,x\,x\ and is given by x\ — x\. The coordinate map g sends (x,y) £ Q 2 with 
x < y to the interval [x,y] E I. 

Let J be the 1-dimensional interpretation with domain formula true, and where 
the coordinate map h is [x, y] > x. The formula (x < y)i is the primitive positive 
formula 

3u, v (m(u, xi) A m(u, v) A m(v, X2)) ■ 
We show that Jo I and Jo I are pp-homotopic to the identity interpretation. The 
relation {(xi,X2,y) \ h{g{x\, x^)) = y} has the primitive positive definition x\ = y. 
To see that the relation R := {(it, v,w) \ g(h(u), h(v)) — iu} has a primitive positive 
definition in (I; to), first note that the relation 

{(u,v) \u = [ui,u 2 ],v = [vi,v 2 ],ui =v%} 

has the primitive positive definition cf>i(u,v) = 3w (m(w,ui) A m(w, u 2 )) in (I; to). 
Similarly, {(u, v) \ u — [ui, 112], v = [v±, V2], U2 — ^2} has a primitive positive definition 
<p2(u,v). Then the formula <f>i(u,w) A 4>2(v,w) is equivalent to a primitive positive 
formula over (I;m), and defines R. □ 

5.6. Varieties 



Varieties (which we have introduced briefly in Section 5.5) are a fascinatingly 
powerful tool to study classes of algebras. The complexity of CSP(03) for a finite core 
structure 03 only depends on the variety generated by the polymorphism algebra of 
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58. This comes from the fact that a finite algebra is in the variety generated by a 
finite idcmpotant algebra B if and only if it is in the pseudo- variety generated by B; 
the link between the pseudo-variety generated by B and the CSP has already been 



explained in Section 5.5 



The section has two parts. In Section 5.6.1 we explain the role of varieties for 
the study of CSPs with finite templates. In particular, we present various equivalent 
forms of the tractability conjecture for finite domain constraint satisfaction. 

The fact that for finite cores 58 the computational complexity of CSP(58) can be 
captured by the variety generated by the polymorphism algebra of 58 does not gen- 
eralize easily to w-categorical structures 58. But it will turn out that the tractability 
frontier in the classification results in Chapter [9] and Chapter [lO] can be described 
elegantly using varieties, and the description is very similar to the tractability conjec- 
ture for finite domain constraint satisfaction. In Section [5.6.2| we provide some partial 
explanation to this phenomenon. 

5.6.1. The Tractability Conjecture. In this section, we present some clas- 
sical results that specifically hold for finite algebras and are relevant to constraint 
satisfaction. We also discuss more recent universal-algebraic results about finite alge- 
bras. We cannot fully cover all recent developments here, but sketch in this section 
some of the highlights. 

We have already mentioned in the introduction the dichotomy conjecture of Feder 



and Vardi 89 , which we state here since it is one of the central stimulating conjectures 



for finite domain constraint satisfaction. 

Conjecture 5.1 (Dichotomy Conjecture [89] ) . Let 58 be a structure with finite 
relational signature and finite domain. Then CSP(58) is in P or NP-complete. 



We will now see a stronger conjecture, due to |60| 61 , that exactly describes 



which finite-domain CSPs are NP-hard, and which can be solved in polynomial time. 
This conjecture is called the tractability conjecture, and it has been confirmed in many 
important cases, for example for 

• finite structures 58 that contain a unary relation symbol for each subset of 



the domain of 58, due to 56 (see also 



structures over a 3-element domain 58 



and 



• digraphs without sources and sinks 14 , and which includes the case of 
undirected graphs. 

The tractability conjecture can be formulated in terms of primitive positive in- 



terpretability (Section 5.5) as follows 



Conjecture 5.2 (Tractability Conjecture). Let 58 be a finite structure with finite 
relational signature, and let £ be the expansion of the core of 58 by a constant symbol 
for each element b of 58. Then CSP(58) is NP-hard if there is a primitive positive 
interpretation of ({0, 1}; 1IN3) in £, and can be solved in polynomial-time otherwise. 

We remark that the first part of this conjecture follows directly from Corol- 



lary 5.5.13 Moreover, we have also seen that CSP(58) is polynomial-time equivalent 
to CSP(£), so all that remains to be shown is to prove polynomial-time tractability of 
CSP(£) when £ does not admit a primitive positive interpretation of a hard Boolean 
CSP. By the results in Section [B~5"l this condition can be translated into a condition 
of the pseudo- variety generated by the polymorphism algebra C of (£. 

When B is a finite and idempotent algebra, it turns out that a finitely generated 
algebra (and in particular a finite algebra) A is in the pseudo-variety generated by 



B if and only if A is in the variety generated by B (see 63 ; the claim follows from 
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Exercise 11.5 in combination with the proof of Lemma 11.8 there). Varieties have the 
advantage that they can be described by the equations satisfied by its members. 



Theorem 5.6.1 (Birkhoff; see e.g. 113 or |63|). Let r be a functional signature 



A a t -algebra, and C be a class of T-algebras. Then the following are equivalent. 

• All universal conjunctive sentences that hold in all members of C also hold 
in A. 

• A is in the variety generated by C . 

• AG HSP(C). 



Theorem 5.6.1| is important for constraint satisfaction since it can be used to 



transform the 'negative' statement of not interpreting a hard boolean CSP into a 
'positive' statement of having polymorphisms satisfying non-trivial identities. The 
following theorem is an application of this philosophy, and goes back to Walter Taylor 
(Corollary 5.3 in 185 ; see also Lemma 9.4 in 112] ). 



Theorem 5.6.2. Let 03 be a finite structure, and suppose that the polymorphism 
algebra B of 03 is idempotent. Then the following are equivalent. 

• ({0, 1}; 1IN3) does not have a primitive positive interpretation in 03. 

• every 2- element algebra in the pseudo-variety generated by B contains an 
essential operation. 

• every 2-element algebra in HSP(B) contains an essential operation. 

• B has a Taylor term, that is, an n-ary operation, for n > 2, such that for 
every 1 < i < n there are Xx, . . . ,x n , y\, . . . ,y n € {x, y} such that (B; f) 
satisfies 

Vx,y. f(xx, ■ ■ ■ 

=f(vi> ■ ■ ■ ,yi-i,y,Ui+i, ■ ■ -jj/n) • 



Even though Theorem |5.6.2| is of central importance in universal algebra, it was 
discovered only recently and under the influence of work in the context of constraint 
satisfaction that the existence of Taylor terms is equivalent to the existence of various 
other terms satisfying stronger conditions, which are likely to be of greater use in the 
quest for polynomial-time algorithms. 

Theorem 5.6.3. Let B be a finite idempotent algebra. Then the following are 
equivalent. 

• B has a Taylor term. 

• B has a weak near unanimity, that is, an n-ary operation f , for n > 2, that 
satisfies 

Vx, y. f(x, ...,x,y) = f(x, . . . , y, x) = ■ ■ ■ = f(y, x,...,x) . 

• B has a Siggers terrrj^j that is, a four-ary operation f that satisfies 

Vie, y. f(y, y, x, x) = f(x, x, x, y) = f(y, x, y, x) . 

• B has a cyclic term, i.e., an n-ary operation f , for n>2, that satisfies 

Vxi,...,x n . f(xi,...,x n ) = f(x 2 ,...,x n ,xx) ■ 



^Originally, Siggers gave equations for a six-ary operation, using the universal-algebraic formu- 
lation from [57] of the dichotomy theorem for the CSPs for undirected graphs Sj from |107] . This 
was later improved by an anonymous referee of [180| to the given identities for a 4-ary operation, 



using the main result from 14 ; see concluding comments in 180 
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Note that weak near unanimities, Siggers terms, and cyclic terms are (special) 
Taylor terms, and that cyclic terms are (special) weak near unanimities. Also note 
that the existence of a Siggers term can be decided (for an explicitly given finite 
idempotent algebra B), and hence the condition of the tractability conjecture for 
finite domain constraint satisfaction is decidable. We would also like to remark that 
binary commutative operations, that is, operations / satisfying f(x,y) — f(y,x), are 
Taylor terms, and that Conjecture |5.2| is already open in this special case. 

Another improvement of Theorem |5.6.2| is the observation that when the variety 
generated by a finite idempotent algebra 03 contains a 2-element algebra without 
essential operations, then this algebra is already contained in IIS (03) (see Proposition 



4.14 in 60 ) 



Theorem 5.6.4 (of [60]). Let B be a finite idempotent algebra. Then HSP(B) 
contains an algebra without essential operations if and only i/HS(B) does. 

Since all algebras in HS(B) are smaller than B (or isomorphic to B), this gives 
another access to an algorithm that decides whether a given structure 03 satisfies the 



equivalent conditions in Theorem 5.6.2 (besides the approach via searching for Siggers 
terms mentioned above). 



5.6.2. Canonical Operations. We cannot offer an full analog of Theorem 5.6.2 
for w-categorical structures 03. This section treats the special case where the polymor- 
phism algebra of an w-categorical structure 03 resembles a finite algebra in a certain 
formal sense; in this case, an analog of Theorem |5.6.2| can be transferred from the 
finite. 

Let 03 be a structure. Then / : B k — > B is called m-canonical (with respect to 
53) if for all m-tuples t\, . . . the m-type of f(t±, . . . , tfc) in 03 only depends on the 
to- types of t\, . . . ,tk in 03. It is called canonical if it is m-canonical for all finite m. 

Lemma 5.6.5. Let 03 be a structure with a finite number q of m-types, and let B be 
an algebra with signature r such that all operations o/B are m-canonical with respect 
to 03. Then there exists a r -algebra A of size q and a surjective homomorphism \x 
from B m to A. 

PROOF. Let p x ,...,p q be the m-types of 03. Define \x: B' m — > {l,...,q} by 
g(bi , . . . , b m ) = i if (&!,... , b m ) has type p^. Since the operations of B are m-canonical, 
the kernel of /j, is a congruence K of B m . Then A := B m /A" satisfies the requirements 
of the statement. □ 

The algebra A constructed from B in the proof of the previous lemma will be 
called the type algebra of B, denoted by T m (B). 

Lemma 5.6.6. Let 03 be an u>- categorical model- complete core with a finite number 
of m-types, and suppose that all operations of the polymorphism-algebra B of 03 are 
m-canonical with respect to 03. Then T m (B) is idempotent. 

Proof. When 03 is an w-categorical model-complete core, then all orbits of m- 
tuples in 03 are preserved by the endomorphisms of 03. It follows that every operation 
/ of T m (B) satisfies Wx. f(x, . . . , x) = x. □ 

Note that if 03 is homogeneous in a relational signature with maximal arity m 
(or first-order interdefinable with such a structure), then being m-canonical implies 
being n-canonical for all n > m. In this case, we simply write T(B) instead of T m (B). 

Lemma 5.6.7. Let 03 be first- order definable in a homogeneous structure £ having 
finite signature, and suppose that all the operations of the polymorphism algebra B of 
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05 are canonical with respect to £. Then there is an operation f in T(B) such that 
for some i\, . . . , i n < n 

T(B) |= VS. f(xi, ...,x n ) = /On,- • • ,Xi n ), 

if and only if there is an n-ary g E Pol(05) and an a € Aut(£) such that 

B |= VS. g(xi, . . . , x„) = ag(x H , . . . , x in ) . 

Proof. Let m be the maximal arity of £, and let q be the cardinality of T(B) = 
T m (B). Let fi be the surjective algebra homomorphism from B m to T(B) from 
Lemma l5.6.5l 

To prove the backwards implication of the statement, let / E Pol(05) and a E 
Aut(£) be such that B |= \/x. g(xi, ■ ■ ■ ,x n ) — ag{xi 1 , . . . ,Xi n ). All universally quanti- 
fied atomic formulas satisfied in B also hold in products and homomorphic images (the 
easy direction of Theorem 5.6.1), and therefore VS. g(x±, . . . , x n ) — ag(xi 1} . . . ,Xi n ) 
is satisfied also by T(B). Since a is an automorphism of £, it preserves all m-types 
in £, and thus a T ^ is the identity. We conclude that T(B) |= Vx. g(x\, . . . , x n ) = 
g(xi ± , . . . ,Xi n ), as required. 

For the forwards direction, we claim that for all finite subsets A = {(a\, . . . , a^), 
. . . , (a*, . . . , a„)} of *B™ there exists an automorphism j3 of £ such that 

f*(a{,...,ai) = Pf li (ai,...,ai) (8) 

holds for all 1 < j < k. We write for (a{, . . . , a J n ). Let h be the operation defined 
by h(xi, . . . ,x n ) — f{xi ± , . . . ,Xi n ). We prove that the tuple t := (/ B (a 1 ), . . . , / B (a fe )) 
has the same type in €. as the tuple s := (/^(a 1 ), . . . , /i B (a fc )). Recall that the type 
of t in £ is determined by the m-types of all tuples (/ B (a ei ), . . . , / B (a 6m )) in £, for 
1 < ex, . . . , e m < k. Note that 

/ A (M a lV • ■ : a i m )> ■ • './"(an:- - ■-««')) = ^(mKS- ■■><"■)>•■ - ,M«V ■ ■ : a n m )) 

by assumption, and hence (/ B (a ei ), . . . , / B (a em )) has the same type in £ as the tuple 
(h B (a ei ), . . . , ft, B (a em )). We conclude that t has the same type as s in £, and there 
exists an automorphism j3 of £ that maps i to s, satisfying {[sj) as desired. 

The existence of g E Pol(25) and a,b E End(£) so that B |= Mx.g(x\, . . . ,x n ) = 
a(g(xi 11 . . . ,Xi n )) and B |= Vx. b(a(x)) — x follows from Lemma 3.1.8 since those 
two sentences are universal, and the existence of such functions g, a, b holds locally as 
we have seen above. Since a is invertible in End(£), it is an automorphism of £ as 
required. □ 



Theorem 5.6.8. Let 05 be a model- complete core and interdefinable with a struc- 
ture £ that is homogeneous in a finite relational signature, and let B be the polymor- 
phism algebra of 58 . When all operations of B are canonical with respect to £, then 
the following are equivalent. 

(1) The two-element structure ({0, 1}; {(0, 0, 1), (0, 1, 0), (1, 0, 0)}) does not have 
a primitive positive interpretation in 05. 

(2) The pseudo-variety (equivalently, the variety) generated by B does not con- 
tain a two- element algebra A all of whose operations are projections. 

(3) B has Taylor terms modulo automorphisms of 05, that is, there is an n-ary 
operation f in B and for all i < n there are x\, . . . , x n , yi, . . . , y n E {x, y} 
and unary invertible operations a\, . . . ,a n in B such that B satisfies 



Vx. f(x!, . . . ,Xi-i,x,x i+ i, . ..,x n ) = onf{yx, . . . ,yi-x,y,y i+1 , ...,y n ) 
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(4) B has a weak near-unanimity term modulo automorphisms of 03, that is, 
an n-ary operation f, for n > 2, and unary invertible operations a%, . . . , a n 
satisfying 

Vx, y. f(x, ...,x,y) = a 2 f(x, . . . , y, x) = ■ ■ ■ = a n f(y, x, . . . , x) . 

(5) B has a Siggers term modulo automorphisms of 03, that is, a four-ary op- 
eration f and unary invertible operations a±,a2 satisfying 

Vx, y- f(y, y, x, x) = a x f{x, x, x, y) = a 2 f(y, x, y, x) . 

(6) B has a cyclic term modulo automorphisms of 03, that is, an n-ary operation 
f , for n > 2, and a unary invertible operation a such that 

B \=Vxi, . . . ,x n . f(x u . . . ,x n ) =af(x 2 ,...,x n ,xi) . 

Proof. Let m be the maximal arity of £. Note that T(B) is a finite idempotent 
algebra by Lemma 5.6.6 Hence we can apply Theorem |5. 6. 2| Consider first the case 



that T(B) has an n-ary operation g for n > 2 such that T(B) |= Vx. g(x\, . . . , x n ) 
g(x2, ■ ■ ■ , x n , X\). By Lemma 5.6.7| there is an / and an invertible unary a in B such 



that B |= Vx. f(x\, . . . , x n ) = af(x 2 , ■ ■ ■ , x n , X\). The cases for Taylor terms, weak 
near-unanimities, and Siggers terms are similar. Otherwise, by Theorem |5.6.2| there 
exists an algebra A with domain {0, 1} in the pseudo-variety generated by T(B) such 
that every operation in A preserves 1IN3. Since T(B) G H(W n ) we have that A is 



also in the pseudo-variety generated by B, and it follows from Theorem 5.5.15 that 



({0, 1}; 1IN3) has a primitive positive interpretation in B. □ 

For w-categorical model-complete cores, the property to have a polymorphism 
satisfying Taylor term identities modulo automorphisms is stable under expansions 
by finitely many constants, as we see in the following. 

Proposition 5.6.9. Let *B be an u>- categorical model- complete core. If *S has a 
polymorphism that satisfies Taylor term identities modulo automorphisms o/*B, then 
the expansion of 03 by finitely many constants also has a polymorphism that satisfies 
(the same) Taylor term identities modulo automorphisms o/25. 

Proof. Let c = (ci, . . . , c^) be the constants, and £ be the expansion of 58 by 
those constants. Let / be the polymorphism such that for every 1 < i < n there are 
Xi, . . . , x n , yi, . . . ,y n € {x, y} and invertible unary operations a\, . . . , a n in B such 
that 

f(xi,.. .,Xi-i,x,x i+ x, ...,x n ) = aif(yi, . . .,yi-i,y,y i+ i, ...,y n ). 

The operation / given by x <— > f{x, . . . , x) is generated by automorphisms of 03 since 
03 is a model-complete core, and hence there exists an automorphism (3 of 03 with 
f3f(c) = c. The operation g = f3f preserves the constants and is a polymorphism of 
£. Define 7$ = ficuji' 1 , and note that since /(c) = onf{c) we have 

7i (c) = pa^- x {c) = paj{c) = (3f(c) = c . 

This shows that 7* preserves the constants and is an automorphism of €. We claim 
that g satisfies Taylor term identities (the same Taylor term identities as /) modulo 
automorphisms of 03, since for all i < n 

Q\X\) . . . , Xi— 1 , X, Xi-\-i , . . . , X n ) — j3 f (x \ , . . . , Xi—\, X , Xi-\-\, . . . , X n ) 

= P&if{yi, ■ ■ ■ ,yi-i,y,yi+i, ••■,y n ) 
= nPfiyi, ■ ■ - ,yi-i,v,yi+i,-- -,Vn) 
= iig(yi, ■ ■ ■ , y%-x, y, Vi+i, ■ ■ ■ , y n ) ■ 

□ 
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How strong is the assumption that every operation of an oligomorphic clone is 
canonical with respect to some homogeneous structure £ over a finite relational signa- 
ture? In Chapter [8j we see that under fairly general Ramsey-theoretic assumptions on 
£ we can find canonical operations in a natural way. Indeed it turns out that when 03 
is first-order definable in the random graph (V; E) and does not primitively positively 
interpret ({0, 1}; 1IN3), then 03 is the reduct of a structure where all polymorphisms 
are canonical. We make the following general conjecture. 

Conjecture 5.3. Let 03 be a countable lj- categorical model- complete core. Then 
either 03 interprets all finite structures primitively positively with parameters, or 03 has 
a k-ary polymorphism f and an automorphism a such that Vxi, . . . ,x n . f(x%, . . . , Xk) — 
af(x 2 ,.-.,x k ,xi). 

This conjecture has been confirmed for all 03 that have a first-order definition 
over (Q;<), and for all 03 that are definable over the countably infinite universal 
homogeneous graph. 

Let us remark that in order to 'prevent' primitive positive interpretations of all 
finite structures, it suffices to have a Taylor term / modulo automorphisms applied 
to the arguments of / and to the function value of /, in the following sense. 

Proposition 5.6.10. Let 03 be an lj -categorical structure with polymorphism al- 
gebra B, and suppose there is an n-ary f such that for all i < n there are x%, . . . , x n , 
Vx, • • ■ i Un € {x, y} and unary ao, . . . , a„, [3q, . . . , /3„ such that B satisfies 

Vx. a f(aixi, . . . , eti-iXi-i, a t x, a i+ ix t+ i, . . . , a n x n ) 

= /3o/(/3iyi, ■ • ■ , Pi-m-i, fay, Pi+m+i, ■ ■ -iPnyn) ■ (9) 

Then there is no primitive positive interpretation of ({0, 1}; 1IN3) in 03. 

Proof. Suppose that ({0, 1}; UN 3) had a primitive positive interpretation in 
03 . Then by Theorem 5.5.18| there is a 2-element algebra A in the pseudo- variety 
generated by B all of whose operations are projections. The algebra A is in particular 
in the variety generated by B, and satisfies Q. Since the unary operations must be 
projections, the i-th equation prevents that / is the i-th projection. Because we have 
such an equation for each argument of /, the operation / cannot be a projection. □ 



Proposition 5.6.10 can be used in combination with Proposition 5.6.9 to show 
that the two cases in Conjecture |5.3| are indeed disjoint, as we see in the following. 



Corollary 5.6.11. Let 03 be an lj- categorical model- complete core. //03 inter- 
prets all finite structures primitively positively with parameters, then 03 does not have a 
k-ary polymorphism f and an automorphism a such that Vxi, . . . , x n . f(x\, . . . , Xk) = 
af(x 2 ,...,x k ,xi). 

The following question has a positive answer for finite structures T. 

Question 5.1. Let 03 be a countable uj -categorical relational structure such that 
the model- complete core o/03 has a k-ary polymorphism f and an automorphism a sat- 
isfying Vxi, . . . , x n . f(x\, . . . , Xk) = af(x2, ■ ■ ■ , Xk, x\). Is it true that also 03 has a k- 
ary polymorphism f and an automorphism a' such that Vxi, . . . , x n . f (x\, . . . , Xk) = 
ct ' f '{x2, ■ ■ ■ , Xk, Xi) ? Is the same true if we only require that a is an endomorphism 
rather than an automorphism of ' 03 ? 



CHAPTER 6 



Equality Constraint Satisfaction Problems 
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This section is about structures with a first-order definition in (N; =); such struc- 
tures will be called equality constraint languages. From a model-theoretic perspective, 
equality constraint languages appear to be trivial altogether. However, the set of all 
such structures, taken up to primitive positive interdefinability and ordered by inclu- 
sion, is a quite complicated object (there are actually 2" many equality constraint 
languages that pairwise do not define each other primitively positively |28| ). 

By the results of Section [3. 3[ a structure 93 is isomorphic to an equality constraint 
language if and only if *B is preserved by all permutations of its domain. Therefore, 
this chapter is about locally closed clones that contain all permutations of the domain. 



On a finite domain, such clones have been completely described in |104[ ; it turns out 
that the number of clones that contain all permutations of a fixed finite domain 
is finite. Clones on infinite sets that contain all permutations are of independent 
interest in universal algebra 106 145||166"||167 . Local closure is a strong additional 



assumption, which allows a good understanding of the lattice of all locally closed 



clones that contain all permutations 28 



The CSP for a finite equality constraint language is called an equality constraint 
satisfaction problem. Equality CSPs are of fundamental importance in infinite domain 
constraint satisfaction; we mention some reasons. 

• NP-hard equality relations are very good candidates for establishing hard- 
ness results of other infinite domain CSPs. For instance, it follows from the 
results presented in this section that every structure which admits a primi- 
tive positive definition of the relation {(x, y,z) \ (x = y ^ z) V (x ^ y = z)} 
has an NP-hard CSP. 
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When analyzing an w-categorical structure 05 with the universal-algebraic 
approach, the question which equality constraint languages can be defined in 
05 is of crucial importance, as we will see for instance in Chapter [9] and [To] 
For example, if the relation {(x,y,u,v) \ x = y u = v} is primitive 
positive definable in 05, then every polymorphism of 05 that depends on all 



its arguments must be injective (Proposition 6.1.4). 

Suppose we want to classify the computational complexity of CSP(05) when 
05 has a first-order definition in a fixed infinite structure £; examples of such 



classifications will be given in Chapter [9] and 10 Then such a classification 



includes the classification of equality constraint satisfaction problems. 



The complexity of equality CSPs has been completely classified 37 ; those prob- 
lems are in P or NP-hard. In this chapter we present a new proof of this result, and 
show that either the template 05 has a binary injective polymorphism, in which case 
05 has a quantifier-free Horn definition in (N; =), and CSP(05) is in P, or ({0, 1}; NAE) 
has a primitive positive interpretation in 05, and CSP(05) is NP-complete. The new 
proof has the advantage that it divides the argument into several steps, some of which 
generalize to much larger classes of structures. Thus, several results of this chapter 
turn out to be useful in later classification arguments, in particular in Chapter [9] 

We would also like to mention that the fact that satisfiability of quantifier-free 
Horn clauses over (N; =) can be decided in polynomial time has already been observed 



123 . Here, we derive the algorithm from more general principles that will also be 



important for our algorithmic results in Chapter M and Chapter [TO] 
6.1. Independence of Disequality 

The importance of independence in constraint satisfaction has been recognized 
several times; the first appearance of this concept in the literature seems to be 
141 , and, subsequently, in 1142] . In this thesis, we focus on independence of 



in 



disequality, which found most applications; the general definition of independence has 



been worked out in 71 . Applications of this concept have been studied in metric 
temporal reasoning |120 131] and qualitative reasoning cacluli 54 55 ; also see 34 



Definition 6.1.1 (Independence of Disequality). Let 05 be a structure with re- 
lational signature r. Then we say that ^ is independent from 05 if for all primitive 
positive r-formula <f>, if both <f> A x ^ y and (f> A u ^ v are satisfiable over 05, then 
(f>Ax^=yAu=/=v is satisfiable over 05 as well. 

In this section we prove that for arbitrary w-categorical structures, independence 
of disequality is equivalent to the existence of a binary injective polymorphism. The 
following definition comes from |37| . 

Definition 6.1.2. A relation R C B k is called intersection-closed if for all It- 
tuples (u\, . . . ,Uk), (vi, ■ . . ,Wfc) € R there is a tuple (w\, . . . ,Wk) € R such that for all 
1 < i, j < k we have it), ^ Wj whenever Ui ^ Uj or Vi ^= Vj. 

Lemma 6.1.3. Let 05 be an uj- categorical structure where ^ is primitive positive 
definable. Then the following are equivalent. 

(1) Disequality is independent from 05. 

(2) Every finite induced substructure of 05 2 admits an injective homomorphism 
into 05. 

(3) 05 has a binary injective polymorphism. 

(4) All primitive positive definable relations in 05 are intersection-closed. 

Proof. Throughout the proof, let &i,&2>- • ■ be an enumeration of the domain 
B of 05. If / is a binary injective polymorphism of 05, then clearly every relation 
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in 25 is intersection-closed, so (3) implies (4). The implication from (4) to (1) is 
straightforward as well. 

We now show the implication from (1) to (2). Let 21 be a finite induced sub- 
structure of 25 2 . Then the domain of 21 is of the form {bi, . . . , b n } 2 , for sufficiently 
large n. It clearly suffices to show that the structure induced by {b\, . . . , b n } 2 in 23 2 
homomorphically maps to 25, so let us assume without loss of generality that the 
domain of 21 is . . . , b n } 2 . 

Consider the formula (f> whose variables x\, . . . , x n i are the elements of 21, 

xi := ...,x n := (&i,6„), . . . ,x n 2_ n+1 := (b n ,bi), ...,x n a := (6 n ,6 n ) , 

and which is the conjunction over all literals R{{bi 11 bj 1 ), . . . ,(bi k ,bj k )) such that 
R(bi 1 , . . . ,bi k ) and R{bj 1 , . . . ,bj k ) hold in 25. So (f> states precisely which relations 
hold in 21. 

Using induction over the number of disequalities, we will now show that for any 
conjunction a := f\ 1<k<m Xi k 7^ Xj k with the property that 7^ jk for all 1 < k < m, 
the formula cj> A a is satisfiable over 23. This implies that there exists an n 2 -tuple t in 
25 with pairwise distinct entries which satisfies <j>; the assignment that sends every Xi 
to ti is an injective homomorphism from 21 into 25. 

For the induction beginning, let Xi =^ Xj be any disequality. Let r, s be the 
n 2 -tuples defined as follows. 

r := (bi, . . . ,bx,b 2 , . . . ,b 2 , . . . ,b n , . . . ,b n ) 

s := (61,62, ...,&„,&!, 6 2 , 6 n , ... , 61, 6 2 , • • • , 6„). 

These two tuples satisfy <j>, because the projections to the first and second coordinate, 
respectively, are homomorphisms from 21 to 23. Now either r or s satisfies Xi ^ Xj, 
proving that <p A Xi ^ Xj is satisfiable in 25. 

In the induction step, let a conjunction a := Ai<fc< m x ik 7^ x jk ^ e given, where 
m > 2. Set a' := f\s<k<m Xi k ^ x jk' an d <f>' '■— 4> A o' . Observe that cj)' has a 
primitive positive definition in 25, as <j> an d ^ have such definitions. By induction 
hypothesis, both </>' A Xi 1 7^ Xj 1 and 4>' A xi 2 7^ satisfiable in 25. But then 

4>' A Xi 1 7^ Xj 1 A Xi 2 7^ Xj 2 — 4> A a is satisfiable over 25 as well by (1), concluding the 
proof. 



The implication from (2) to (3) follows from Lemma 3.1.5 because the property 
that a function is injective can be described by the universal first-order sentence 
Vx,y(x?y=>f(x)?f(y)). □ 

Also the situation that the polymorphisms / of an w-categorical structure are 
'essentially injective' can be characterized using equality relations. 

Proposition 6.1.4. Let f be an operation from B k to B that depends on all 
arguments. Then the following is equivalent. 

(1) f is injective. 

(2) / preserves the relation defined by x = y u = v. 

(3) / preserves the relation defined by x = y => u = v. 

Proof. For the implication from (1) to (2), suppose that / is injective. We 
check that / preserves x = y ^ U = v. Let a,b,c,d be elements of B k such that 
a, = bi 4^ Cj = di for all i < k, and let t be the tuple (f(a),f(b),f(c),f(d)). If 
a = b, we thus have that Cj = di for all i < k, and so c = d. In this case, t satisfies 
t\ = t 2 and t 3 = ti, and we are done. Similarly, if c = d then a = b and we are done. 
Otherwise, a 7^ b and c 7^ d, and by injectivity of / we have t\ 7^ t 2 and £3 7^ £4. So 
we have in all cases that t\ = t 2 if and only if £3 = £4. 
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For the implication from (2) to (3), note that x = y =>■ u = v is equivalent to a 
primitive positive formula over (B; R) where R = {(a, 5, c, d) \ a = b c = d}). The 
primitive positive formula is 

3w y, u, w) A R(x, y, w, v) . 

Finally, for the implication from (3) to (1), suppose that there are distinct a,b € 
B k such that f(a) = f(b). We want to prove that / violates x = y =>■ u — v. Let 
I be the set of all i G {1, . . . , k} such that aj 7^ bi. Since a and b are distinct, / is 
non-empty; let i € I be arbitrary. Since / depends on the z-th argument, there are 
c, d G D with Cj = dj for all j ^ i, and bi 7^ c$. We claim that (a,b,c,d) shows 
that / violates x = y =>■ u = w. First, note that for all j € {1, . . . , k} \ I, we have 
that a,j = bj and Cj = dj. Next, note that for all j 6 / we have that dj 7^ 6j, so 
we indeed have that (a,b,c,d) satisfies x = y => u = v. However, /(a) = f(b) and 

M¥>M- □ 

We close with an application to CSPs. When 21 is an instance of CSP(Q5), then 
an injective homomorphism from 21 to 58 is also called an injective solution for 21. 

Proposition 6.1.5. Suppose that 58 has a binary injective polymorphism h. Then 
every satisfiable instance 21 o/CSP(Q5) either has an injective solution, or 21 has two 
distinct elements a, a' such that s(a) = s(a') in all solutions s for 21. 

Proof. Suppose that 21 has a solution, but no injective solution. Let / be a 
solution such that the cardinality of / is maximal. Since there is no injective solution, 
there are two elements a, a' of 21 such that f(a) = /(a'). We claim that s(a) = s(a') 
in all solutions s of 21. Otherwise, if s(a) 7^ s(a') for some solution s, then by the 
choice of / there must be another pair b,b' such that s(b) 7^ s(b') but fib) 7^ /(&'). 
Then the mapping x H> h(f(x), s(xj) is also a solution to 21, but has a strictly larger 
image than /, a contradiction. □ 

6.2. Existential Horn Formulas 

In this section we show that when 05 has certain binary injective polymorphisms 
if and only if all relations in 58 have a quantifier-free Horn definition over a 'base' 
structure £; this will often be useful to design algorithms for CSP(58). 



The structure (N; —) has quantifier-elimination: this follows from Lemma 3.6.1 
by the observation that every bijection between finite subsets of N can be extended to 
a permutation of N. So we could have defined equality constraint languages as those 
relational structures where all relations have a quantifier-free definition in (N;=). 
Note that not all equality constraint languages have quantifier-elimination; however, 
all equality constraint languages are model-complete. 

Proposition 6.2.1. Every equality constraint language 58 is model- complete. 

Proof. The permutations of N locally generate all injective self-maps on N. 
Hence, the statement follows from Theorem |3 . 6 . 7| by the observation that the embed- 
dings of 58 are locally generated by automorphisms of 58. □ 

The following is a simple, but very useful definition to prove syntactic results. 

Definition 6.2.2 (as in |28| ). A quantifier- free first- order formula <f> in conjunc- 
tive normal form is called reduced (over a structure 58 j if every formula obtained 
from <j) by removing a literal or a clause is not equivalent to (j) (over 58J. 

Clearly, every quantifier- free formula is equivalent to a reduced formula over 58, 
because we can find one by successively removing literals and clauses from <f>. The 
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following theorem is from 27 and [33] (stated there for quantifier-free Horn formulas 
only). 

Theorem 6.2.3. Let 03 be a structure with an embedding e from 03 2 into 03. 
Then a relation R with an existential definition (quantifier-free definition) in 03 has 
an existential Horn definition (or quantifier-free Horn definition, respectively) in 03 
if and only if R is preserved by e. 

Proof. (Backwards.) Let S be an existential Horn definition of R over 03, written 
in prenex conjunctive normal form. It suffices to demonstrate that e preserves each 
clause in 6. Note that a Horn clause ip of 6 can always be written in the form 
(fa A • • • A fa) — > fa, for atomic r-formulas fa, . . . ,fa. Let V be the variables of ip, 
and let S\,S2- V — > N be two assignments that satisfy the clause. We claim that 
S3 : V — > N defined by Ss(x, y) = e(sx(x), S2(y)) satisfies ip. There are two cases cases 
to consider. Either there is an i < I such that s\ or S2 does not satisfy fa. In this 
case, since e is an embedding from OS 2 to 03, S3 does not satisfy fa, and therefore 
satisfies ip. Or, if for all i < I both si and S2 satisfy fa, then they also satisfy fa. 
Since e is a polymorphism of 03, it follows that S3 satisfies fa, and therefore also ip. 

(Forwards.) Consider an existential definition 5 of R in 03 such that 6 is in prenex 
normal form, and that the quantifier-free part rj of 8 is a reduced CNF formula over 
03. Assume for contradiction that <5 is not existential Horn, that is, rj has a clause 
ip = fa V fa V fa V • ■ • V fa where fa , fa are positive literals, and fa, . . . ,fa are positive 
or negative literals. Let V be the variables of rj. Since rj is reduced, it has a satisfying 
assignment si : V — > N such that fa is false for all i < I except for i — 1; otherwise, 
we could remove fa from ip and would obtain a formula that is equivalent to rj over 
03, contradicting the assumption that 77 is reduced. Similarly, 77 has a satisfying 
assignment S2 : V — > N such that fa is false for all i < Z except for i = 2. Then 
S3 : V — > N defined by Ss(x, y) — e(si(x),S2{y)) does not satisfy ip, a contradiction. 

It is clear that the same proof works in the special case that the relation R has 
a quantifier-free definition over 03: the proof in fact shows that every quantifier- free 
reduced formula that is preserved by e must be quantifier-free Horn. □ 

The structure 03 := (N; =) is an obvious example with an embedding from 03 2 
into 03. When 03 is a relational structure, then 03^ denotes the expansion of 03 by all 



relations that are the complement of a relation from 03. The following is from 27 
(note that we de not assume cj-categoricity of 03 and £) . 

Theorem 6.2.4. Let £ be a structure with an embedding e from £ 2 into £. Let 
03 be a relational structure with finite signature a that is preserved by e and has an 
existential definition in €. Then there is a polynomial-time Turing reduction from 

csp(03) to csp(er). 



Proof. We use the algorithm shown in Figure 6.1 which is due to [71]. By 



Theorem 6.2.3 every relation of 03 has an existential Horn definition in £. Let cj> be 
an input instance of CSP(03), and let ip be the sentence in the language of £ obtained 
from <p as described in the algorithm. Since a is finite and fixed, and does not depend 
on the input, there is only a linear number of literals that can be deleted from ip in 
the course of the algorithm. It is thus clear that the algorithm works in polynomial 
time. 

To show that the algorithm is correct, observe that <j> is false in 03 if and only if 
ip is false in £. We first show that if the algorithm rejects, then ip is false in 03. The 
reason is that whenever a negative literal rj is removed from a clause of ip, then in 
fact -177 is implied by the other clauses in ip, and therefore removing rj from ip leads 
to an equivalent formula. 



124 



6. EQUALITY CONSTRAINT SATISFACTION PROBLEMS 



// Input: An instance <p of CSP(QS) 

/ / Assumption: 03 has an existential Horn definition in a r-structure £. 
Replace each constraint R(xi, . . . , x n ) from <j> by 5{x\, . . . , x n ), 

where S is an existential Horn definition of R in £. 
Let ip be the resulting r-sentence, written in prenex conjunctive normal form. 
Repeat := true 
While Repeat = true do 

Repeat := false 

Let ^ be the set of all singleton clauses in ip. 
If is unsatisfiable over £ then reject. 
For each negative literal n of ip do 

If <P U n, considered as an instance of CSP(C^), is unsatisfiable 
Remove n from its clause in ip 
Repeat := true 
End for 
Loop 
Accept 



Figure 6.1. A polynomial-time Turing reduction from CSP(03) to 
CSP(CP) when 03 has a polymorphism that is an embedding of £ 2 
into £. 



Finally, we show that if the algorithm accepts, then ip is true in €. Let B be the 
domain of 03 and £, and let V be the set of variables of ip. Consider the negative 
literals r)i, . . . ,r) m that are in clauses of ip at the final stage of the algorithm. For all 
i < m, let ti : V — » B be an assignment that satisfies all clauses of ip without negative 
literals, and which also satisfies r\i. Such an assignment must exist, since otherwise r\i 
would have been false in all solutions, and our algorithm would have removed rji in 
the inner loop of the algorithm. We claim that s : V — > B given by 

s(x) = e{ti(x),e(t 2 (x), . . . e(t m -i(x),t m (x)) ...)) 

satisfies all clauses of ip. Negative literals r/k are satisfied because satisfies r/k, and 
e is an embedding of £ 2 into £. Positive literals from ip are satisfied by s because 
they are satisfied by all the t iy and since e is a polymorphism of <£. □ 



6.3. Classification 

We now finish the complexity classification for CSP(03) where 03 is an equality 
constraint language. The first and central step is the proof that if 03 contains ^ and 
has an essential polymorphism, then 03 also has a binary injective polymorphism. 
Here we use Corollary |5.3.11 about existence of binary essential polymorphisms, and 



Lemma 6.1. 3| about the existence of binary injective polymorphisms. The last part 



of the argument is specific to equality constraint languages. However, the technique 
appears to be quite flexible, and will be adapted to constraint languages that are 
first-order definable over the random graph in Section |9.3.2| 

Theorem 6.3.1. Let 03 be an equality constraint language with an essential poly- 
morphism, and suppose ^ is primitive positive definable in 03. Then 03 has a binary 
injective polymorphism. 



PROOF. Since equality constraint languages are 2-transitive, Corollary |5.3.11 



plies that 03 is preserved by a binary essential operation /. We apply Lemma 6.1.3| 



and have to show that for every primitive positive formula <p the formula <j> A x ^ 
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y A u 7^ v is satisfiable over 03 whenever <fi A x ^ y and <f> Au ^= v are satisfiable over 
23. 

Since / is essential, it violates the relation Pf (Lemma 5.3.2). Hence, there are 
tuples r,s £ P% such that f(r,s) £ P% . Since / preserves 7^, we can assume that 
r is of the form (a, a, b) for a =/= b, and s is of the form (c, c£, d) for c =/= d. Since 
f(r, s) £ Pf, we have that f(a, c) ^ f(a, d) and f(a, d) ^ f(b, d). 

Let V be the variables of <j>, and let s : V — > N be a satisfying assignment for 
4> A x ^ y, and V -> N be a satisfying assignment for cj> A it 7^ v. We can assume 
that s(u) = s(v) and £(a?) = t(y), otherwise we are done. Let k be the cardinality of 
the set {s(x), s(y), s(u),t(u),t(v),t(x)}\ note that 4 < k < 6. Suppose that fc = 6, 
the other cases are simpler. 

Let a',6',c',d' € N be arbitrary such that a,b,a',b' are pairwise distinct and 
a,c,a',c' are pairwise distinct. Then, f(a',c') 7^ f(b',d') since / preserves 7^. Note 
that this implies that f(a',c') — f(a',d') and f(a',d') = f(b',d') cannot both be 
true. If f(a',c') ^ f(a',d'), then choose permutations a,/3 of N such that as(u) = 
a',as{x) = a,as(y) = b and pt{u) = c\j3t{v) = d',/3t(x) = d. If f(a',d') ^ f(V,df), 
then choose permutations a, j3 of B such that as(u) — a,as(x) = a',as(y) = b' and 
f3t(u) = c,/3t(v) = d, (3t(x) = d' . It is straightforward to verify that in both cases the 
assignment 

z ^ f(as(z),f3t(z)) 

is a satisfying assignment for (j> A x 7^ y A u 7^ v. □ 

Corollary 6.3.2. Let 03 be an equality constraint language. Then one of the 
following cases applies. 

(1) 03 has a constant polymorphism. 

(2) 03 has a binary infective polymorphism. 

(3) In 03 every first-order formula is equivalent to a primitive positive formula. 

Proof. Suppose that 03 does not have a constant polymorphism. Since equality 
constraint languages have 2-transitive automorphism groups, we can use the contra- 



positive of Corollary 5.3.8 to derive that all polymorphisms of 03 must preserve 7^. 
The endomorphisms of 03 are therefore injective, and locally generated by the auto- 
morphisms of 03. If 03 does not have essential polymorphisms, then Corollary |5.3.4| 
shows that all relations that are first-order definable in 03 are also primitive positive 
definable in 03, and we are in case (3). If 03 has an essential polymorphism, then 03 



has a binary injective polymorphism by Theorem 6.3.1 □ 



We can now give the complexity classification for equality constraint languages. 

Theorem 6.3.3. Let 03 be an equality constraint language. Then exactly one of 
the following cases applies. 

• 03 has a polymorphism f and an automorphism a such that 

f(x,y) = af(y,x) 

for all elements x and y of 03 . In this case, for every finite reduct 03' of 03 
the problem CSP(03') can be solved in polynomial time. 

• There is a primitive positive interpretation of ({0, 1}; HNS) in 03. In this 
case, there is a finite reduct 03' o/03 such that CSP(03') is NP-complete. 

Proof. By Proposition |5.6.10l the two cases are disjoint. If 03 has a constant 
polymorphism, then clearly there are / and a such that f{x,y) — af(y,x) for all 
x,y € B. The claim for finite reducts of 03 follows from Proposition |1 . 1 . lTj 

Now suppose that 03 has a binary injective polymorphism /. Such an operation 
is an embedding of (N; —) 2 into (N; =), and we can find a permutation a of B such 
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that f(x,y) = af(y,x) for all x,y € N. Since every relation of 03 has a quantifier- 
free definition over (N;=), Theorem 6.2.3 shows that every relation of 03 even has 
a quantifier-free Horn definition over 03. By Theorem |6.2.4| the CSP for every fi- 
nite signature reduct of 03 can be reduced to GSP((N; =, ^)) in polynomial time. 
Tractability of CSP((N; =, ^)) has been shown in Section[T7l] By Corollary [6A2| the 
only remaining case is that over 03 all first-order formulas are equivalent to primitive 
positive formulas. In this case the claim follows from Corollary |5.5.11| □ 



CHAPTER 7 



Topology 




Several important properties of cj-categorical structures only depend on their 
automorphism group considered as a topological group, that is, on their automorphism 
group viewed as an abstract group, with the topology of pointwise convergence on 
the group elements. This is in particular the case for certain Ramsey properties that 
become important in the next chapter. We therefore give a self-contained introduction 
to basic topological background, with a focus on the topics that become relevant for 
our applications to automorphism groups of w-categorical structures. 

7.1. Topological Spaces 

A topological space is a set S together with a collection of subsets of S, called the 
open sets of S, such that 

(1) the empty set and S are open; 

(2) arbitrary unions of open sets are open; 

(3) the intersection of two open sets is open. 

Complements of open sets are called closed. For ECS, the closure of E is the set of 
all points x such that every open set in S that contains x also contains a point from 
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E. Clearly, the closure of E is a closed set. A subset of E of S is called dense (in S) 
if its closure is the full space S. The subspace of S induced on E is the topological 
space on E where the open sets are exactly the intersections of E with the open sets 
of S. 

Definition 7.1.1. A mapping between two topological spaces is called continuous 
if the pre-images of open sets are open, and open if images of open sets are open. A 
bijective open and continuous map is called a homeomorphism. 

A basis of S is a collection of open subsets of S such that every open set in S 
is the union of sets from the collection. For s £ S, a collection of open subsets of 
S is called a basis at s if each set from the collection contains s, and every open set 
containing s also contains an open set from the collection. For a sequence (s„) rl >i 
of elements of S, we write s n — > s, and say that s n converges (against the limit s) if 
for every open set U of S that contains s there exists an no such that s n £ U for all 
n > no- 

A topological space S is called 

• discrete if every subset of S is open (and hence also closed) ; 

• compact if for an arbitrary collection {Ui}i^A of open subsets of S with 
S = {J ieA Ui there is a finite subset B of A such that S = {J ieB Un 

• Hausdorff if for any two distinct points u, v of S there are disjoint open sets 
U and V that contain u and v, respectively; 

• first- countable if for all s £ S there exists a countable basis at s. 

• separable if there is a countable dense set; 

• metrizable if S has a compatible metric, i.e., a metric d on S so that the 
open sets are unions of sets of the form {y £ S | d(c,y) < r}, for x £ S, 
< r £ K; 

• completely metrizable if it has a compatible complete metric d, i.e., a metric 
d on S where every Cauchy sequenc^ converges against an element of S; 

• Polish if S is separable and completely metrizable. 

The following equivalent characterization of continuity of maps on a first-countable 
space S is often easier to work with. For x £ S, we say that / is continuous at x if 
for every open V C S containing f(x) there is an open U C S containing x whose 
image fill) is contained in V . 

Proposition 7.1.2. Let S be a first- countable and T an arbitrary topological 
space. Then for every f : S — > T the following are equivalent. 

(1) f is continuous. 

(2) For all s n , if s n s then f(s n ) — > f(s). 

(3) / is continuous at every x £ S . 

Proof. The implication from (1) to (2) is true even without the assumption that 
S is first-countable. Let (s n ) n >i be such that s n — ¥ n, and let V be open so that 
f(s) £ V. Then U :— /~ 1 (1 / ) is open, and s £ U. So there exists an n with s n £ U. 
For this n, f(s n ) £ V. So f(s n ) -> f{s). 

For the implication from (2) to (3), we show the contraposition. Suppose that / is 
not continuous at some s £ S. That is, there exists an open set V containing f(s) such 
that all open sets U that contain x have an image that is not contained in V. Since S is 
first-countable, there exists a countable collection U n of open sets containing x so that 
any open V that contains x also contains some U n . Replacing U n by (~]^: =1 Uk where 
necessary, we may assume that Ui D U% D ■ ■ ■ ■ If U n C / _1 (V^), then f(U n ) C V, in 

*A sequence (s n ) of elements of a metric space (S; d) is a Cauchy sequence if for every e there 
is an no > such that for all n, m > no we have that d(s n , s m ) < e. 
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contradiction to our assumption; so we can pick an x n G U n \ for all n, and 

obtain a sequence that converges to x. But s n £ f l {V) for all n, and so f{s n ) does 
not converge to f(s) G V. 

Finally, the implication from (3) to (1) again holds in arbitrary topological spaces. 
Let l^CTbe open. We want to show that U := f~ l {V) is empty. When s is a point 
from U, then because / is continuous at s, and V contains f(s) and is open, there 
is an open set U s C S containing s whose image f(U s ) is contained in V. Then 
lj sg(7 U s — U is open as a union of open sets. □ 

The product Yl ieI Si of a family of topological spaces (Si)i e j is the topological 
space on the cartesian product XjgjSj where the open sets are unions of sets of the 
form Xi^iUi where Ui is open in Si for all i 6 J, and Ui — Si for all but finitely 
many i G I. When I has just two elements, say 1 and 2, we also write Si x £2 for the 
product (this operation is clearly associative and commutative). We denote by S k for 
the fc-th power S X • • • x S of S, equipped with the product topology as described 
above. 

We also write S 1 to a |/|-th power of S, where the factors are indexed by the 
elements of /. In this case, we can view each element of T := S 1 as a function from 
I to S in the obvious way. The product topology on T is also called the topology of 
pointwise convergence, due to the following. 

Proposition 7.1.3. Let S be a topological space, and I be a set. Then — > £ 
in the product topology on T = S 1 if and only if £ n (j) ~^ £(i) in S for all j G I. 

PROOF. Suppose first that £ n — >• £ in T. Let j G I be arbitrary and let V be an 
open set that contains £(j). Then the set U :— Ili<i<j S x V x Y\j <i<K S is open 
in T and contains £, so there is an uq such that G U for all n > n$. But then 
£n(j) <= v for a11 n t n o, and so £„(j) -> £(j). 

Now suppose that £„(j) — > £(j) in S for all j G 7, and let V be an open set of T 
that contains f. Without loss of generality V is of the form V\X ■ ■ ■ xT4x S x S X ■ ■ ■ , 
for open subsets Vi, . . . , Vk of S. For each j < k there exists an rij so that G V; 
for all i > rij. Then £j G V for all /c > max(n\, . . . , rife), and hence □ 

Since a topological space is Hausdorff if and only if limits of converging sequences 
are unique, it follows in particular that products of Hausdorff spaces are Hausdorff. 
The following is more substantial. 

Theorem 7.1.4 (Tychonoff; see e.g. 118| ). Products of compact spaces are com- 
pact. 

Example 7.1.5. When we equip the natural numbers N with the discrete topol- 
ogy, then N N with the topology of pointwise convergence is called the Baire space. 
The open sets are exactly the unions of sets of the form {g G N — > N | g(a) = b} for 
some a,b G N k , k G N. □ 

7.2. Topological Groups 

A topological group is an (abstract) group G together with a topology on the 
elements G of G such that (x, y) xy~ x is continuous from G 2 to G. In other words, 
we require that the binary group operation and the inverse function are continuous. 
Two topological groups are said to be isomorphic if the groups are isomorphic, and 
the isomorphism is a homeomorphism between the respective topologies. 

Example 7.2.1. The eleme nts of the group Sym(N) form a (non-closed) subset 
of the Baire space N N (Example 7.1.5), and the topology induced by the Baire space 



on Sym(N) is also called the topology of pointwise convergence. More generally, the 
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topology of pointwise convergence on Sym(X) is the topology where the open sets 
are the unions of sets of the form {g £ Sym(A) | 17(a) = b} for some finite tuples a, b 
over X. □ 

An action of a topological group G on a topological space S is continuous if it is 
continuous as a function from G x S into S. An important example of a continuous 
action of topological groups is the following. A left coset of a subgroup V of G is a set 
of the form {hg \ g £ V} for h £ G, also written hV. Clearly, the set of all left cosets 
of G partitions G, and is denoted by G/V. The cardinality of G/V is the index of V 
in G. We view G/V as a topological space where a set of left-cosets is open if their 
union is open in G. Analogously we define the space G\V of all right-cosets Vh. We 
can define a continuous action of G on G/V by setting g ■ hV — ghV. This action is 
also called the natural action of G on G/V. 

Every open subgroup H of G is closed, since the complement of H in G is the 
open set given by the union of open sets gH for g £ G \ H . 

A topological group G is 

• Hausdorff (metrizable, Polish) if the topology of G is Hausdorff (metrizable, 
Polish, respectively); 

• first- countable if it has a countable basis at the identity. 

• non-archimedian if it has a basis at the identity consisting of open subgroups. 
It is easy to see that every metrizable topological group is Hausdorff and first- 
countable. We also recall the following. 

PROPOSITION 7.2.2 (Proposition 13 and Proposition 14 in 53"). Let G be a 
topological group, and let H be a subgroup of G. Then 

• G/H is discrete if and only if H is open in G; 

• G/H is Hausdorff if and only ifH is closed in G. 

The topological automorphism group of a structure 03 with domain B is a topolog- 



ical group obtained from the abstract automorphism group G of 03 (see Section 3.3.41 



by equiping the elements G of G with the topology of pointwise convergence, that is, 



the topology induced on G by the one on Sym(_B) as given in Example 7.2.1 



Proposition 7.2.3. A set £3 of permutations of a set B is a closed subset of 



Sym(_B) if and only if it is locally closed as defined in Definition 3.3.1 



Proof. The set of operations £3 is not closed in the topology of pointwise con- 
vergence if and only if there exists a permutation g £ Sym(_B) \ S$ such that every 
open set containing g also contains an element of 33. This is the case if and only that 
for every tuple a, £3 contains an operation h such that the restriction of g to the ele- 
ments of a equals the restriction of h to those elements. According to Definition [33TJ 
this is exactly the case when g is in the local closure by □ 

In the following, let G be a topological group that is the automorphism group 
of a relational structure 03, and let G be its domain (equipped with the topology of 
pointwise convergence). Note that if G is compact then all orbits of G must be finite. 
Hence, when 03 is w-categorical, G cannot be compact. 

It is clear that G is non-archimedian. The topology on G has the following 
compatible metric d. When &i,&2,... is an enumeration of the domain B of 03, 
then for elements /, g £ G we define d(f,g) — l/2 ,l+1 where n is the least natural 
number such that /(&„) ^ g{b n ). This metric is not complete: to see this, let / be 
an arbitrary injective non-surjective mapping from B — > B. For each n, there exists 
a permutation h n of B such that h n (bi) = f(bi) for all i < n. Hence, the sequence 
(^n)ri>i is Cauchy, but it does not converge to a permutation. The topology on G 
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also has the following compatible complete metric d. For elements /,j£Gwe define 
d(f,g) = l/2" +1 where n is the least natural number such that f(b n ) ^ g(b n ) or 
f (b n ) 7^ In fact, the metric d that we have just defined is an ultra-metric, 

that is, it satisfies d{x, z) < max(d(x,y), d(y, z)) for all x,y,z. Moreover, d is left- 
invariant, i.e., d(gx,gy) = d(x,y) for all g,x,y G G. Finally, G is separable: for all 
finite tuples a, b that lie in the same orbit we fix an element of G that maps a to b; 
the (countable) set of all the selected elements of G is clearly dense in G. 

In this thesis, we will be exclusively interested in topological groups that arise as 
automorphism groups of structures; those groups can be characterized in topological 



terms, as demonstrated in Proposition 7.2.4 below 



PROPOSITION 7.2.4 (see Theorem 2.4.1 and Theorem 2.4.4 in [94]). Let G be a 
topological group. Then the following are equivalent. 

(1) G is isomorphic to the topological automorphism group of a countable rela- 
tional structure. 

(2) G is isomorphic to a closed subgroup o/Sym(N). 

(3) G is Polish and admits a compatible left-invariant ultrametric. 

(4) G is Polish and non-archimedian. 

(5) G is Polish and has a countable basis closed under left multiplication, that 
is, a countable basis B of G so that for any U G B and g G G, gll G B. 



PROOF. The equivalence of (1) and (2) has been shown in Proposition 3.3.2 
The implication from (1) to (3) has been explained in the paragraph preceding the 
statement of the proposition. So it suffices to show (3) => (4) =>■ (5) (2). 

For the implication from (3) to (4), let d be a left-invariant ultrametric on G. 
Let U n = {x G G | d(x, 1) < 2~™}, for n € N. We claim that the set of all those U n 
forms a basis at the identity consisting of open subgroups. Since d is a left-invariant 
ultra-metric, for x, y G G we have 

d{x~ 1 y, 1) = d(y, x) < max(d(y, l),d(l, x)) 

and thus U n is a indeed a subgroup. 

For the implication from (4) to (5), assume (4). Let {U\, U2, ■ ■ ■ } be a countable 
basis at the identity (which exists since G is metrizable). Each Ui has an open 
subset Vi which is a subgroup, since G has a basis at the identity consisting of open 
subgroups. Then {V±, V2, . ■ ■ } is a countable basis of the identity consisting of open 
subgroups. The set of all cosets of those groups gives a countable basis that is closed 
under left multiplication. 

Finally, we show that (5) implies (2). Let B = {Ui, Ui, . . . } be a countable basis 
closed under left multiplication. We define the map £ : G — > Sym(N) by setting 

£,(g)( n ) = m & g u n = u m ■ 

This mapping is injective: when / and g are distinct, then there are disjoint open 
subsets U and V with f E U and g G V; since B is a basis, we can assume that 
U = U ni and V = U n2 , for some ni,ri2 > 1- If fU ni — gU ni , then g G U ni = U 
since / G U ni , contradicting the assumption that U and V are disjoint. Hence, 
£(/)(m) ^ £Cff)(ni), and so £(/) ± fo). 

It is straightforward to verify that £,(fg) = £,(f)S,(g), and since bijective algebra 
homomorphisms are isomorphisms, £ is an isomorphism between G and a subgroup 
of Sym(N). It can also be verified that £ is a homeomorphism between G and a closed 



subgroup of Sym(N); for the details of this last step, we refer to 94 . □ 



132 



7. TOPOLOGY 



A subgroup N of G with domain N is called normal if gN = Ng for all elements 
g of G. Recall the following equivalent characterizations of normality of subgroups, 
which can be seen as a refinement of Proposition |5. 5.2] for the case of groups. 

PROPOSITION 7.2.5. Let G be a group, and N be a subgroup of G. Then the 
following are equivalent. 

(1) N is normal. 

(2) G has the congruence E — {(a, b) | ab" 1 G N}. 

(3) There is a homomorphism h from G to some group such that N — ft, _1 (0). 

(4) For every g G G and every v € N we have gvg^ 1 G N. 

Proof. (1) =>■ (2): to verify that E is a congruence, we have to show that for 
all (a 1 ,bx),(a 2 ,b 2 ) G E, (ai<z 2 ,&i& 2 ) € Indeed, (a 1 a 2 )(6i6 2 ) _1 = ai(a 2 &2 ^i" 1 e 
aiiV^ 1 = Naxb^ 1 C AA = A. 



(2) =>■ (3): follows from Proposition 5.5.2 The left coset space G/N is a group 



with the product (aN)(bN) = (ab)N, and g i— > <?A is a group homomorphism from 
G to G/N. 

(3) =>. (4): For g G G and u G ^~ 1 (0), we must show that gvg~ x G ft _1 (0). 
Indeed, h{gvg~ l ) = h{g)h{v)h(g)- 1 = h{g)Qh(g)- 1 = 0. 

(4) (1): assume that gNg~ x C AT for all g G G. Let a G G be arbitrary. 
Applying the assumption for g — a we find that aN C iVa. Applying the assumption 
for ,g = a" 1 we find that a" 1 iV(a" 1 )" 1 = a^Na C TV, and hence TVa C aTV. We 
conclude that aN — Na. □ 

When G is a closed subgroup of Sym(N), then closed normal subgroups of G 
typically arise as the subgroups consisting of those elements of G that fix the blocks 
of a congruence relation on the elements of G. This can be made precise as follow^ 

Proposition 7.2.6. Let G be the automorphism group of a relational structure 
*B with domain B. If E is a G-invariant equivalence relation on B n , for some n, then 
the subgroup of G that preserves each equivalence class of E is closed and normal. 
Conversely, every closed normal subgroup of G is the intersection of closed normal 
subgroups that arise in this way. 

Proof. Let £ be the expansion of *B by a unary relation for each equivalence 
class of E. Then Aut(£) is closed by Proposition |3.3.2 and it is a normal subgroup 



of Aut(*B): when g G Aut(Q5) and h G Aut((£), then g o h o g^ 1 preserves each 
equivalence class of E, and thus is an automorphism of £. Normality of Aut(£) 
follows from Proposition 1 7 . 2 . 5] 

For the second part, suppose that G has a closed normal subgroup N. Consider 
the relation 

Rn : = {{x, y) | x, y G B n and there is h G N such that h(x) = y} . 

This relation is obviously an equivalence relation, and it is preserved by all the au- 
tomorphisms of 58. For this, we have to show that for all g G G and all (x,y) G R n 
we have that (g(x),g(y)) G R n . So suppose that x, y G B n such that h{x) — y for 
some h G N. Then g(y) = g(h(x)) G (gN)(x) = (Ng)(x) = N(g(x)) by normality 
of N, and hence there exists an/i'eiV such that h'(g(x)) — g(y), which shows that 
(g(x),g{y)) G R n . 

Let £ be the structure that contains for all n the n-ary relations given by the 
equivalence classes of the relations R n for all n > 0. We claim that N is precisely 
the automorphism group of £. As in the first part we can verify that every h G N is 



2 I thank Todor Tsankov for pointing this out to me. 
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an automorphism of £. The converse follows by local closure as follows. Let g be an 
automorphism of C, and let x, y be from B n so that g{x) = y. Since g preserves the 
equivalence classes of i?„, there exists an h G N such that h{x) = y. Hence, g lies in 
the closure of N, which implies that g is from N since N is closed. □ 

Example 7.2.7. The automorphism group G of the structure 03 = (Q;Betw), 
where Betw = {(x,y,z) \ (x < y < z) V (z < y < x)}, is 2-transitive and therefore 
primitive. But {({x 1 ,x 2 ), {yi,y 2 )) | {x x <x 2 Ayi< y 2 ) V {x 1 > x 2 A y x > y 2 ) V (xi = 
x 2 A yi = 7/2)} is a G-invariant equivalence relation on Q 2 . And indeed, G has a 
closed normal subgroup N that is isomorphic to the automorphism group of (Q; <), 
and G/N has two elements, corresponding to the automorphisms that reverse the 
order <, and the automorphisms that preserve the order. □ 



7.3. Oligomorphic Groups 

In the last section we have seen conditions that describe when a topological group 
is the automorphism group of a countable structure. In this section, we see conditions 
that describe when a topological group is the automorphism group of a countable to- 
categorical structure. In fact, Theorem 7.3.1| shows that the information whether a 



countable structure 03 is w-categorical is not only coded into the automorphism group 



of 03 viewed as a permutation group (as we have seen in Section 3.3), but already 
coded into the topological automorphism group of 03. 

A topological group G is called Roelcke precompact if for every open set U C G 
that contains the identity there exists a finite set F C G such that G = UFU . 



Theorem 7.3.1 (of Tsankov 188| ). Let G be a closed subgroup o/Sym(N). Then 
the following are equivalent. 

(1) G is the automorphism group of a countable uj- categorical structure. 

(2) G is Roelcke precompact, and has an open subgroup of infinite index. 

(3) G has a continuous transitive action on an infinite set D with the discrete 
topology, and every such action is oligomorphic. 

Proof. For the implication from (1) to (2), suppose that G is the automorphism 
group of an w-categorical structure 03, and let G be the domain of G, which is a set of 
permutations of the domain B of 03. For every a G B the pointwise stabilizer G a is an 
open subgroup of infinite index in 25. To show that G is Roelcke precompact, let U be 
an open subgroup of G. Then there exists an n such that U contains the stabilizer G a 
for an n-tuple a of elements of B. It clearly suffices to show the existence of a finite 
number of elements g%, . . . ,gk of G such that G = {J i<k G a g;G a . By Theorem 3.1.4 
G has finitely many orbits of 2n-tuples; so let (a, g\ ■ a), . . . , (a, g% • a) be a complete 
list of representatives for those orbits of 2n-tuples that are contained in G ■ a x G ■ a. 
We claim that G a giGa U • • • U G a gkGa = G. Let / e G be arbitrary. Let i < k be 
such that / • a and gi ■ a lie in the same orbit of 71-tuples in G a . So there exists an 
h G G a such that f ■ a = hgi ■ a. Then / _1 o h o g i lies in G a , so / G GagiGa as 
required. 

To show that (2) implies (3), assume (2), and let V be the open subgroup of G 
of infinite index. Since V is open, G/V has the discrete topology, and the natural 
action of G on the infinite set G/V is continuous and transitive. Next we show 
that every continuous transitive action • of G on an infinite set D is oligomorphic. 
We show by induction on n that such an action has only finitely many orbits of n- 
tuples, for all n. This is true by assumption for n = 1. For the induction step, fix 
a — (01, . . . , a n ) G D n , and let c be an arbitrary element from D \ {a\, . . . , a n }. By 
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Roelcke precompactness of G, there exists a finite set {fi, ■ ■ ■ , ft} Q G such that 
G = G-achGac U • • • U G ac f k G- ac . Let B(a) be {ft ■ c, . . . , f k ■ c}. 

Claim 1. For every d € D\ {oi, . . . , a„} there is an /i 6 G a and b £ -B(a) such 
that d — h ■ b. By transitivity of G, there is a g € G so that d = g ■ c, for arbitrary 
d £ D\ {ai, . . . , a„}. Let i, /ii, /12 be such that hi,fi2 £ G ac and g — h\fih-i- Then 
d = gc = hxfih?, ■ c = h\fi ■ c, proving Claim 1. 

Claim 2. When {a 1: . . . , d s } is a complete set of representatives for the orbits of 
n-tuples of the permutation group G, then 

{fab) I i < s,b£ B^)} 

is a complete set of representatives for the orbits of (n + l)-tuples. Let (c, d) £ D n+1 . 
By assumption there exists g £ G such that g ■ di = c. Find h £ G ai and b £ B(di) 
such that g~ x ■ d= h ■ b. Then one has 

gh ■ {di, b)=g - (di, h ■ b) = (c, d) . 

This shows that G has finitely many orbits of (n + l)-tuples, and concludes the 
induction step. 

Finally, we prove the implication from (3) to (1). We assume (3); in particular, 
that there is a continuous action on an infinite set D that has finitely many orbits. 
By (3), the set of permutations of D that is induced by this action is an oligomorphic 
permutation group, and hence by Corollary |3.3.9| is the automorphism group of an 
w-categorical relational structure with domain D. □ 

Note that the groups from Theorem |7.3.1| must always have continuum cardinality; 



this follows from the following and the remarks after Lemma 3.1.10 



Theorem 7.3.2 (Corollary 4.1.5 of [114] ) . Let 05 be a countable structure. Then 
the following are equivalent. 

(1) I Aut(®)| < lj 

(2) j Aut(*B)| < 2" 

(3) There is a finite tuple a in 03 such that Aut((Q3,a)) = 1 

7.4. Bi-interpretations 

When two w-categorical structures share the same topological automorphism 
group, then the relationship between the two structures can be described model- 
theoretically. 

Theorem 7.4.1 (Ahlbrandt and Ziegler (3|). Two uj -categorical structures 03 and 
€. are bi-interpretable if and only i/Aut(03) and Aut(C) are isomorphic as topological 
groups. 

The subgroup of G consisting of the identity element only is called trivial, and 
subgroups of G that are distinct from G are called proper. 

Example 7.4.2. The structures £ := (N 2 ; {(x, y), (u, v) \ x = u}) and D := (N; =) 
are mutually primitive positive interpretable, but not bi-interpretable. To see this, 
observe that Aut(£) has a proper non-trivial closed normal subgroup N such that 



Aut(£)/N is isomorphic to Aut(S) (see Proposition 7.2.6), whereas Aut(S)), the sym- 



metric permutation group of a countably infinite set, has no proper non-trivial closed 



normal subgroups (it has exactly three proper non-trivial normal subgroups 177 



none of which is closed) . □ 
Ahlbrandt and Ziegler also showed the following. 
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Theorem 7.4.3 (From [3j; also see Theorem 5.3.5 and 7.3.7 in 113]) . Let £ 



be an ui- categorical structure with at least two elements. Then a structure 03 has a 
first-order interpretation in £ if and only if 05 is the reduct of a structure 05' such 
that there is a surjective continuous group homomorphism f : Aut(£) — > Aut(03'). 

Several fundamental properties of w-categorical structures 05 are preserved by 
bi-interpretability, and therefore, by Theorem |7.4.1| only depend on the topological 
automorphism group of 05. As we will see in Chapter [8j this is for instance the case 
for the property whether an ordered homogeneous structure has the Ramsey property. 
We list some other properties of this type. 

Definition 7.4.4. Let 03 be an lo- categorical structure. 

• 03 has essentially infinite signature if every structure £ that is interdefinable 
with 03 (equivalently, has the same set of automorphisms as 05, by Proposi- 



tion 3.3.8) has an infinite signature. 
• 03 is homogeneous in a finite signature if 03 is interdefinable with a homo- 
geneous structure 03 of finite (but not necessarily relational) signature. 

The proofs of the following statements are easy and similar to the proof of 



Lemma 5.5.22 and left to the reader. 



Proposition 7.4.5. Let 03 and £ be lo- categorical structures that are first-order 
bi-interpretable. 

• 7/05 has essentially infinite signature, then so has £. 

• If is homogeneous in a finite signature, then so is £. 

• If the number of orbits of n- sub sets in 05 is bounded by 0(2 P ^) for a poly- 
nomial P , then so is the number of orbits of n-subsets in £. 

By Theorem |7.4.1| the three properties in Proposition |7.4.5] only depend on the 
topological automorphism group of 03. But unlike the property of w-categoricity and 
Theorem |7.3.1| we are not aware of any elegant characterization of those properties 
that is directly stated in terms of the topological group. 
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The application of Ramsey theory to study the expressive power of constraint 
languages via polymorphisms is one of the central contributions of this thesis. The idea 
is that polymorphisms must behave in a regular way on large parts of their domain. 
This leads us to decidability results for several meta-questions about the expressive 
power of constraint languages. The same idea can be used to show statements of the 
type 'every polymorphism that violates R must locally generate g\ for certain fixed 
operations g with good properties. Such statements will be crucial in the classification 
projects in Chapters [9] and 10 



In this chapter we first revisit classical concepts and results from structural Ram- 
sey theory in Section |8.1| In order to apply Ramsey theory to analyze polymorphism 
clones, we need the product Ramsey theorem, but also other fundamental facts from 



Ramsey theory, some of which appear to be new (such as Corollary 8.2.12). Those 



facts will be derived from a recently discovered fundamental connection between Ram- 



sey theory and topological dynamics due to Kechris, Pestov, and Todorcevic 126 
This connection allows a more systematic understanding of Ramsey-theoretic princi- 
ples, and we present it in Section [8. 2| The way in which we apply Ramsey theory to 
polymorphisms is described in Section [8.3| We close with an application of our tech- 



nique in Section 8.4 and prove the decidability of various meta-problems concerning 
constraint satisfaction problems, that is, problems where the input is a description 
of a template *B, and the question is whether the corresponding CSP has certain 
properties (for instance, whether certain relations are primitive positive definable). 

Some of the results presented here have been published in |49| ; there is also a 
survey article |47| that additionally covers the applications of our technique for the 
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classification of 'the first-order reducts' of a given homogeneous structure £, that is, 
the structures that arc first-order dcfinblc in £. 



8.1. Ramsey Classes 

This section is about classes C of finite structures that satisfy the following 
Ramsey-type property: for all 21, 05 <E C there exists a £ € C such that when we 
assign finitely many 'colors' to the substructures of £ that are isomorphic to 21, then 
we can find a 'monochromatic' copy of 05 in £, i.e., an induced substructure of £ that 
is isomorphic to 05 and in which all copies of 21 in this substructure have the same 
color. Before we formalize this in detail, we give the classical result of Ramsey, which 
provides a prototype of a class with the Ramsey property. 

From now on, we denote the set {1, . . . , n} also by [n]. Subsets of a set of cardi- 
nality m will be called m-subsets in the following. 

Theorem 8.1.1 (Ramsey's theorem). Let B be a countably infinite set, and let 
m, r be finite integers. When x is a mapping from the m-element subsets of B into [r], 
then there exists an infinite subset P of B such that \ is constant on all m-element 
subsets of P. 



A proof of Theorem 8.1.1 can be found in 114| (Theorem 5.6.1); for a broader 



introduction to Ramsey theory see [99] 

Let ( ) denote the set of all m-subsets of S. We also refer to mappings / : ( ) — ► 
[r] as a coloring of S (with the colors [r]). It is easy to derive the following finite 
version of Ramsey's theorem from Theorem |8. 1 . 1 1 via a compactness argument. 

Theorem 8.1.2 (Finite version of Ramsey's theorem). For all positive integers 
r, m, k there is a positive integer I = R(r, m, k) such that for every \ : (jjj) — > [r] there 
exists a k-subset S of [I] such that \ is constant on ( )■ 

Proof. A proof by contradiction: suppose that there are positive integers r, m, k 
such that for all positive integers I there is a \ : (^) — > [r] such that for all fc-subsets S 
of [I] the mapping \ is not constant on (^) . Since the property that for all fc-subsets S 
of [I] the mapping \ is not constant on f ) is universal first-order, and since the image 
of x is finite, we can apply Lemma |3.1.8| and get the existence of a mapping x with 



the same property but defined on all integers. This contradicts Theorem 8.1.1 □ 



More generally, when 21 and 05 are r-structures, we write Cz) for the set of all 
induced substructures of 05 that are isomorphic to 21. When 21, 05, £ are r-structures, 
and r > 1 is finite, we write 

£^ [<B)f 

if for all x '■ (a) ~> W there exists 05' € (^) such that x is constant on (® ) . 

Definition 8.1.3. A class of relational structures that is closed under isomor- 
phisms and induced substructures is called Ramsey if for every 21, 05 G C and for every 
finite k > 1 there exists a £ € C such that £ — > (05)^. 

Our first example of a Ramsey class is the class of all finite structures over the 



empty signature; this is an immediate consequence of Theorem 8.1.2 We also observe 
the following. Recall that the age of a r-structure 05 is the class of all finite r- 
structures that embed into 05. 



Corollary 8.1.4. The age of (Q; <) is Ramsey. 
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Proof. This is again a direct consequence of Theorem |8.1.2[ since whether or 
not an m-element substructure is isomorphic to an n-element substructure of (Q; <) 
only depends on n and m. □ 

We will now present further examples of Ramsey classes; the proofs are non-trivial 
and fall out of the scope of this thesis, but we provide references. 

Example 8.1.5. The class of all finite Boolean algebras 03 = (B; U, n, c, 0, 1) has 



the Ramsey property. This is explicitly stated in 126 , page 147, line 3ff (see also 
page 112, line 9ff), where it is observed that this follows from a result by Graham 
and Rothschild [98]. □ 

It might be instructive to also see an example of a class of structures that is not 
Ramsey. Typical examples come from classes that contain structures with non-trivial 
automorphism groups, as in the following. 

Example 8.1.6. The class of all finite graphs is not a Ramsey class. To see this, 
let 21 be the (undirected) graph ({0, 1, 2}; {{0, 1}, {1, 2}}) ( since the edge relation is 
symmetric we write edges as 2-element subsets of the vertices) with three vertices and 
two edges, and let 03 be C4, that is, the undirected four-cycle 

({0,1, 2, 3}; {{0,1}, {1,2}, {2, 3}, {3,0}}) . 

Let £ be an arbitrary graph. We want to show that there is a way to color the copies 
of 21 in £ without producing a monochromatic copy of 03. For that, fix an arbitrary 
linear order < on the vertices of £. We define a coloring \ '■ (™) ~~ ► {0, 1} as follows. 
If there is an embedding h of 21 into £ such that h(0) < h(l) < h(2), then we color 
the corresponding copy of 21 in £ with 0; all other copies of 21 in £ are colored by 1. 
We claim that any copy of 03 in £ contains a copy of 21 that is colored by 1, and one 
that is colored by 0. The reason is that for any ordering of the vertices of 03 there is 
an embedding h! of 21 into 03 such that h'(0) < h'(l) < h'(2), and an embedding h" 
of 21 into 03 such that not h"(0) < h"(l) < h"{2). Hence, £ (03)f . □ 

Frequently, a class without the Ramsey property can be made Ramsey by expand- 
ing its members appropriately with a linear ordering. We will see several examples. 



Example 8.1.7. Nesetfil and Rodl 161 and independently Abramson and Har- 



rington 111 showed that for any relational signature r, the class C of all finite or- 
dered r-structures is a Ramsey class. That is, the members of C are finite structures 
21 = (A; <, i?2, • ■ • ) for some fixed signature r = {<, R2, . . . }, where < is a 
total linear order of A. 

A shorter and simpler proof of this substantial result can be found in [162] and 
[159] ; the proof there uses the partite method, which uses amalgamation to reduce 
the statement to proving the so-called partite lemma; the proof of the partite lemma 
relies on the Hales- Jewett theorem from Ramsey theory (see |99| ). □ 

Example 8.1.8. Recall the homogeneous structure 03 = (B; |) carrying a C- 
relation, introduced in Section [4. 1| We consider the expansion of 03 by a linear order, 
defined as follows. It is easy to see that for every finite tree 1 there is an ordering 
< on the leaves L of T such that for all u,v,w £ L with u < v < w we have either 



u\vw or uv\w (recall Definition 4.1.3). This can be seen from the obvious existence 
of an embedding of 1 on the plane so that all leaves lie on a line and none of the 
edges cross, and take the linear order induced by the line. We call such an ordering 
of L compatible with T. The class of all finite substructures £ of 03 expanded by a 
compatible ordering of the underlying tree of £ is a Ramsey class; this follows from 



more general results by Milliken (Theorem 4.3 in 156 , building on work in [78]), 



and has been shown explicitly in 45 . □ 
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Example 8.1.9. The Ramsey classes from Example |8. 1 . 7| have been further gen- 



eralized by Nesetfil and Rodl as follows 161 . Suppose that N is a (not necessarily 



finite) class of structures with finite relational signature r whose Gaifman graph (Def- 
inition 2.1.2) is a clique - such structures have been called irreducible in the Ramsey 
theory literature. It can be readily verified that C := Forb(A/") is an amalgamation 
class. Then the class of all expansions of the structures in C by a linear order is a 
Ramsey class; again, this can been shown by the partite method [162| . This example 
is indeed a generalization of Example |8.1.7| since we obtain the previous result by 
taking TV = 0. □ 

The fact that all the above Ramsey classes could be described as the age of a 
homogeneous structure is not a coincidence. We have the following (the proof is 



from 160 , and presented here for the convenience of the reader). 



Theorem 8.1.10 (of 160 ). Let r be a relational signature, and let C be a class 
of ordered finite r -structures that is closed under induced substructures, isomorphism, 



and has the joint embedding property (see Section 3.2). If C is Ramsey, then it has 
the amalgamation property. 

Proof. Let 21, 03i, 032 be members of C such that 21 is an induced substructure 
of both 03 1 and 032. Since C has the joint embedding property, there exists a structure 
£ €E C with embeddings e\, ei of 03i and 032 into £. If e±, e^ have the same restriction 
to 21, then we are done, so assume otherwise. 

Let S3 be such that 2) — > (£)f- Define a coloring \ '■ (or) {1,2} as follows. 
When 21' € (^), and / : 21 — > 21' is an isomorphism, then x(2t') = 1 if and only if 
there is an embedding h of £ into D such that / = h o e\. 

Since D — > (£)f , there exists £' S (®), witnessed by an embedding h' of £ into 

D such that \ is constant on (£). Now any copy of £ in 2) contains a copy 21' of 21 

with x(2t') = 1- Thus \ is constant 1 on (^). 

Consider the embedding h! o e2 of 21 into 2); as we have seen above, the corre- 
sponding copy of 21 in 2) is colored 1. Thus there exists an embedding h" of £ into 
T) such that / = h" o e% = h' e^ (here we use the assumption that the structure 21 is 
ordered). This shows that D together with the embeddings h" o e\ : B\ — > D and 
h' o e-2 : B2 — > D is the amalgam of 93 1 and 032 over 21. □ 

It is often convenient to work with the Fraisse-limit of a Ramsey class rather than 
the class itself. Indeed, we have the following. 

Proposition 8.1.11. Let C be an amalgamation class, and let £ be the Fraisse- 
limit of C. Then C has the Ramsey property if and only if€ — > (fB)^ 1 for all 21, 23 € C, 
and k > 2. 

Proof. Let 21, 03 e C, and A; > 2 a finite integer. When C is Ramsey, then 
there exists a (finite) £' £ C such that £' — > (03)f , and because £' is an induced 
substructure of £, we also have £ — > (03)^. 

Conversely, suppose that £ — ► (03)^. When k is the cardinality of (®), then for 
any structure 2) the fact that D — > (03)^ can be equivalently expressed in terms of 
r-colorability of a certain /c-uniform hypergraph, defined as follows. Let © = (V; E) 
be the structure whose vertex set V is (5) , and where (2ti ,...,&&) € -E if there exists 
a 03' e (®) such that (®') = {2li, . . . ,2l fc }. Let ij = ([r];£) be the structure where 
E contains all tuples except the tuples (1, . . . , 1), . . . , (r, . . . , r). Then J) — > (03)^ if 
and only if 25 homomorphically maps to fj . By Lemma |3.1.5| this is the case if and 
only if all finite substructures of & homomorphically map to Sj. Thus, £ — > (03)^ if 
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and only if €' — > for all finite substructure £' of C, and we conclude that C is 

Ramsey. □ 

When 25 is a homogeneous structure with a finite relational signature whose age 
is a Ramsey class, then this fact is useful for studying which relations are primitive 
positive definable in 05, as we will see for instance in Section [8^4} In fact, for those 
applications of Ramsey theory it suffices that 25 can be expanded to a structure €. 
that is homogeneous, Ramsey, and has a finite relational signature. We make the 
following conjecture. 

Conjecture 8.1. Let % be a homogeneous structure with a finite relational sig- 
nature. Then there is a homogeneous expansion £ o/05 with finite relational signature 
whose age has the Ramsey property. 

8.2. Extremely Amenable Groups 

This section presents a link between Ramsey classes and Polish groups that are 
extremely amenable. The link rests on the theorem of Kechris, Pestov, and Todorcevic 
that characterizes those ordered homogeneous structures that are Ramsey in terms 
of their topological automorphism group, and will be presented in Section |8.2.1| In 
Section |8.2.2[ |3.3.4| and |8.2.4[ we use this result to obtain a more systematic under- 
standing of Ramsey classes. 

8.2.1. Extreme Amenability. The Ramsey property for ordered homogeneous 
structures 05 has an elegant characterization in terms of the topological automorphism 
group of 55: the age of 05 is Ramsey if and only if the topological automorphism group 
of 05 is extremely amenable. Extreme amenability is a concept from group theory 



studied since the 60s 100 



Definition 8.2.1. A topological group is extremely amenable iff any continuous 
action of the group on a compact Hausdorff space has a fixed point. 

The following is the combination of Proposition 4.2, Proposition 4.3, Theorem 4.5, 
and Theorem 4.7 from [126| . 

Theorem 8.2.2 (Kechris, Pestov, Todorcevic 126| ). Let 05 be a homogeneous 



relational structure, and let G be the topological automorphism group of 05 . Then the 
following are equivalent. 

(1) The age of 05 has the Ramsey property, and G preserves a linear order < 
on B. 

(2) The age of 05 only contains rigid structures, and has the Ramsey property. 

(3) (a) For any finite subset F of B, the substructure induced by F in 05 is 
rigid, and (b) for all orbits 0\,02 of finite subsets in 05, and for every 
X ■ 0\ — > [r] there is an i < r and an F G O2 such that x{F') — * f or a ^ 
F' CF where F' € Ox. 

(4) For any open subgroup V of G, every x '■ G/V — > [7'], and every finite 
A C G/V there is g € G and 1 < i < r such that x(9 a ) — * f or aZZ a € A. 

(5) G is extremely amenable. 

For structures 05 that are not homogeneous but w-categorical, the equivalence 
between (3), (4), and (5) remains valid, since every w-categorical structure has a 
homogeneous expansion by first-order definable relations - and such an expansion 
has the same automorphism group as 05; we can then apply Theorem |8.2.2| to the 
expansion. It therefore makes sense to call an w-categorical structure 05 Ramsey if 
the age of the expansion of 05 by all first-order definable relations is Ramsey. There are 



also interesting applications of Theorem 8.2.2 when 05 is not w-categorical, see 126 . 
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We point out a remarkable consequence of Theorem |8.2.2| in combination with 
Proposition |3.3.8| 

Corollary 8.2.3. Suppose that 03 is an uj- categorical Ramsey structure where 
all finite induced substructures are rigid. Then a linear order is first-order definable 
in 03. 

When a Ramsey structure 03 has substructures with non-trivial automorphisms, 
Theorem |8.2.2| is therefore not directly applicable. For the applications of Ramsey 
theory we have in mind, though, it is sufficient to know that 03 has a first-order 
definition in an ordered Ramsey structure that is homogeneous in a finite relational 



signature (see Section 8.3 and 8.4). 

The choice of the order is not arbitrary, but plays an important role when we 
want to preserve the Ramsey property. To give another example, consider again 
the countable atomless Boolean algebra, which is an example of an w-categorical 



structure that is not homogeneous in a finite relational signature (Corollary 4.5.8) 



In this case an order expansion with an extremely amenable automorphism group has 
been specified in 126 , and can be found below. 



Example 8.2.4. Let 03 = (B; U, n, c, 0, 1) be a finite Boolean algebra and A its 
set of atoms (see Example 8.1.5 in Section 4.3). Then every ordering a\ < ■■■ < a n 
of A gives an ordering of B as follows (we follow 126| ). For x, y g B, we set x < y 
if there exists an io G {I, . . . , n} such that 

• for all i £E {1, . . . , io — 1} we have that o, n i = o, n y, and 

• x l~l ai Q — and y D a,; ^ 0. 

Such an ordering of the elements of 03 is called a natural ordering. It can be shown that 
the class C of all naturally ordered finite atomless Boolean algebras has the Ramsey 
property (see the comments preceding Theorem 6.14 in 126 , and Proposition 5.6 
m | 126| ). By Theor em 8.I.I0| C is an amalgamation class. The reduct of the Fraisse- 



limit of C with signature {U,n,c, 0,1} is the atomless Boolean algebra (Propositions 
5.2 and 6.13 in 126 ), so we indeed found an extremely amenable order expansion of 
the atomless Boolean algebra. □ 



The main focus of the article by Kechris, Pestov, and Todorcevic 126 is the 
application of Theorem |8.2.2| to prove that certain groups are extremely amenable, 
using known and deep Ramsey results. Here we are rather interested in the opposite 
direction: we are applying Theorem 8.2.2 in the following sections to obtain a more 



systematic understanding of which classes of structures have the Ramsey property 

8.2.2. Surjective continuous homomorphisms. Interestingly, whether an oj- 
categorical structure 03 is Ramsey only depends on the automorphism group of 03 
viewed as a topological group. For this observation we do not need the full power of 
Theorem |8.2.2 the equivalence of (1) and (3) suffices. More generally, we have the 



following. 

Proposition 8.2.5. Let G be an extremely amenable group, and let H another 
Polish group. If there is a surjective continuous homomorphism f : G — > H ; then H 
is also extremely amenable. 

Proof. Let a:JJxS->Sbean action of H on a compact Hausdorff space 
S. Then b : G x S — > S given by (g, s) i-> a(f(g),s) is a continuous action of G 
on S. Since G is extremely amenable, b has a fixed point sq. Now let h € H be 
arbitrary. By surjectivity, there is a g £ G such that f(g) = h. Then we have 
sq = b(g, sq) = &{f{g), s a) = a{h, sq), and hence sq is also a fixed point under a. □ 
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Example 8.2.6. Recall Example |3.1.11| where we introduced an w-categorical 
structure 21 with binary relations and a two-dimensional first-order interpretation 
over (Q; <) - the corresponding relation algebra is also called Allen's Interval Algebra 



(see Section 1.5.1 1 



It follows from the proof of Theorem 5.5.24 that 21 is first-order bi-interpretable 



with (Q; <). Since the automorphism group of (Q; <) is extremely amenable, Theo- 
rem [8]2]2] therefore shows that the automorphism group of 21 is extremely amenable, 
and that 21 is Ramsey. Since 21 is homogeneous, we also have the Ramsey property 
for the age of 21. □ 



Unfortunately, Theorem 8.2.2 leaves Conjecture |8.1| unresolved. An important 



variant of it has an interesting reformulation in terms of topological groups. 

Conjecture 8.2. Every closed oligomorphic permutation group contains a closed 
oligomorphic subgroup that is extremely amenable. Equivalently, every u- categorical 
structure has an ordered uj- categorical expansion that is Ramsey. 

8.2.3. Products. In this section we present an important tool to built new 



extremely amenable groups, Theorem 8.2.7 below. Recall the definition of direct 



products of two groups from Section 3.3.4 The direct product of two topological 



groups Gi and G2 is the direct product of the respective abstract groups, together 
with the product topology on the group elements. 



Theorem 8.2.7 (Proposition 6.7 in 126| ). Let G be a topological group 



(1) Let N a normal subgroup ofG. If both N and G/N are extremely amenable, 
then so is G. 

(2) When G is a finite direct product of extremely amenable groups, then G is 
extremely amenable. 



Item (1) has been stated in 126 under the additional assumption that N is 
closed; however, as we see in the proof, this assumption is not necessary. Item (2) 
can be generalized to infinite products of groups, but we do not need this here, and 



refer to 126 instead. 



Proof. We first show (1). Suppose G acts continuously on a compact Hausdorff 
space S. Let Sn be the subspace of S induced on {x G S | h ■ x — x for all h G N}, 
which is clearly also Hausdorff. Moreover, it is closed. To see this, let x G S and 
/ G N be such that f-x^x. Then there exists an open set V(x,f) C S that contains 
x such that / • y 7^ y for all y G V(x, /). Otherwise, there exists a sequence (x n ) n >i 
with x n — > x such that f(x n ) — x n for all n > 1. But / • x = f ■ lima;„ = lim/ • x n = 
lim contradiction. Then U/eA? xes x ) defines the complement of S'n 

and is open. So Sn is closed, and also compact since S is compact and closed subsets 
of compact spaces are compact. 

As N is extremely amenable, Sn is non-empty. The set Sn is preserved by the 
action of G on S: when x G Sn and g G G, then for any h G N 

h - (g ■ x) = hg ■ x = g ■ (g~ 1 hg) ■ x = g ■ x 

(where the last equality is by normality of TV and Proposition |7.2.5 ) , and so g-x G S'n- 



Now, consider the action of G/N on S'n defined by (gN) ■ x = g-x, which is 



clearly well-defined. To verify continuity of this action with Proposition 7.1.2 let 
(g n N, s n ) — > (gN, s). Then (g n N) ■ s n = g n ■ s n — > g ■ s = (gN)s since the action of G 
on S is continuous. 

By extreme amenability of G/N there is a point p G S'n such that / • p = p 
for all / G G/N. But then, p is also a fixed point for the action of G on S, since 
9 ' P = (9^) • p = P for any g G G. 
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(2) follows from (1): suppose that G = Hi x H 2 , and that Hi and H 2 are 
extremely amenable. Then Hi is a normal subgroup of G, and G/Hi is isomorphic 
to H2, so G is extremely amenable by (1). The statement for Hi x • • • x H„ follows 
by induction on n. □ 

When we later apply Ramsey theory to analyze polymorphism clones in Sec- 
tion |8.3| of this chapter, and when the polymorphisms are higher-ary, it will be crucial 
to apply the so-called product Ramsey theorem. For every ordered Ramsey class there 
is a corresponding product Ramsey theorem (which usually has various slightly dif- 
ferent formulations). This can be shown either directly, or by applying the general 
results from topological dynamics. 

For illustration, we give a direct proof for the class of all finite linear orders 



(Theorem 8.2.8 which will be used extensively for d = m = 2 in Chapter 10 1. For 
concreteness, we give specialized terminology in this case. If Si, . . . , Sd are sets, we 
call a set of the form Si X • • • X Sd a grid, and also write S d for a product of the 
form S x • • • x S with d factors. A [k] d -subgrid of a grid Si x • • • x Sd is a subset of 
S\ X • • • X Sd of the form S[ X • • • X S' d , where S[ is a fc-element subset of Si. 

Theorem 8.2.8 (Product Ramsey Theorem). For all positive integers d, r, m, 
and k > m, there is a positive integer L — R(rf, r, m, k) such that for every coloring 
of the [m] d subgrids of [L] d with r colors there exists a monochromatic [k] d subgrid G 
of [L] d , i.e., G is such that all its [m] subgrids have the same color. 

Proof. Let d, r, m, and k > m be positive integers. We claim that we can 
choose L = R(g?, r, m, k) to be R(r, dm, dk). To verify this, let x be a coloring of the 
[m] d subgrids of [L] d with r colors. We have to find a monochromatic subgrid of [L] d . 

We use \ to define an r-coloring £ of the dm-subsets of [L] as follows. Let 
S = {si, S2, ■ ■ ■ , Sdm} be a dm-subset of [L], with si < s 2 < • • • < Sd m . Then define 

= X({ S 1> • ■ • , S m } X • • • X {S m (d_l)+l, . . . , S dm }) ■ 

By Theorem |8. 1.2 there is a dfc-subset {ti,t2, ■ ■ ■ ,tdk} °f [L] such that £ is constant 



on the dm-element subsets of {ii, . . . , i^fc}. Suppose that ti < t 2 < ■ ■ ■ < tdk- Then 
G = {ti, . . . , tk} x ■ ■ ■ x {ifc(<j_i)+i, • ■ • , tdk} is a subgrid of [L] d that is monochromatic 
with respect to x- D 

We next present a formulation of the product Ramsey theorem for arbitrary 
ordered Ramsey structures. The proof uses Theorem |8.2.2 and Theorem |8.2.7| 



Theorem 8.2.9. Let *Bi, . . . , 03^ be uj- categorical ordered Ramsey structures. Then 
«P := 53 1 M ■ ■ ■ M «8 d is Ramsey. 

PROOF. By Theorem [8. 2. 2 [ the automorphism groups Gi, . . . , G d of Q3i, . . . , 23 <j 
are extremely amenable, and it suffices to show that the automorphism group G of CP 
is extremely amenable. The group G is given by the product action of Gi x • • • x G^ 



on Bi, . . . ,Bd (see Section 3.3.4.31. Hence, extreme amenability of G follows from 



Theorem 15X71 □ 

Thcorcm |8 . 2 . 9| indeed generalizes Theorem |8.2.8[ which can be seen as follows. Let 
r, d, m, k be positive integers. We consider the w-categorical ordered Ramsey structure 



(Q; <), and apply Theorem 8.2.9 where d in Theorem 8.2.9 equals the d given above. 
Let 21 be the structure induced in *}3 :— (Q; <)I d l by some (equivalently, every) [m] d 
subgrid of Q d , and let 03 be the structure induced in *p by some (equivalently, every) 
[k] d subgrid of Q d . Since £ is Ramsey, there exists an induced substructure £ of 03 
such that £ — > (03)^. If £ is not induced by an [L] d subgrid of Q d , for some large 
enough L, we can clearly choose a larger substructure £ with this property, such that 
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still <£ — > (03)®. The occurrences of 21 in £ correspond precisely to the [to] d -subgrids 
of [L] d , which proves the claim. 

8.2.4. Open Subgroups. In this section we show that open subgoups of ex- 
tremely amenable groups are again extremely amenable. This fact will be of great 
importance in Section |8.3.4| and |8.4| when it comes to the applications for analyzing 
polymorphism clones. We first show the following basic fact. 

Proposition 8.2.10. Let X be a topological space, and Y be any set. Let G be a 
topological group that acts on X Y . Then the action is continuous if for every y £ Y, 
the map f y :Gx X Y — > X given by /y (<?,£) := id ' 0(v) * s continuous. 

PROOF. Suppose that ( g w ,g w ) -> (g,£). Then g n -> g and £ n (y) -» £(y ) for all 
y 6 Y", by Proposition 7.1.3 Since f y is continuous and by Proposition 7.1. 2| 



for all y £ Y. We again apply Proposition 7.1.3| and obtain that (<7„ • x n ) — > g ■ £, 



which implies continuity of the action of G, again using Proposition 1 7. 1.2 1 □ 

Proposition 8.2.11 (from |49 ). Let G be an extremely amenable group, and let 
H be an open subgroup of G. Then H is also extremely amenable. 

In the proof, it is not essential but technically more convenient to use right cosets 
instead of left cosets. 

Proof. Let H act continuously on a compact space X; we will show that this 
action has a fixed point. Denote by it : G — >• H\G the quotient map and let s : H\G — > 
G be a section for n (i.e., a mapping satisfying n o s = id) such that s(H) = 1. Let 
a be the map from H\G x G — >• H defined by 

a(w,g) = s(w)gs(wgy 1 . 

For w £ H\G and g £ G, note that s(w)g and s(wg) lie in the same right coset of 
H, namely wg, and hence the image of a is H. The map a satisfies 

u(w,g 1 g 2 ) = s(w)g 1 g 2 {s(wg 1 g 2 ))~ 1 

= s{w)g 1 s(wg 1 )s(wg 1 )~ 1 g 2 (s(wg 1 g 2 )y 1 
= a(w,g 1 )a(wg 1 ,g 2 ) . 

As H is open, H\G is discrete. Hence, s is continuous, and therefore a is contin- 
uous as a composition of continuous maps. 

Now co nsider the product space X H \ G which is Hausdorff and compact by The- 
The co-induced action of G on X H \ G is defined by 



orem 



7.1.4 



(9 ■ 0( w ) = a(w,g) ■ £(wg). 

We claim that this action is continuous. By Proposition |8.2.10[ it suffices to verify 
that the map (g, £) i-> (g ■ £)( w ) is a continuous map from G x X H \ G —> X for every 
fixed w e H\G. We already know that a is continuous and that the action of H on 
X is continuous. To see that (g,£) *-> £(wg) is continuous, suppose that (g n ,£n) 
(<?,£)■ Let w — Hk. As g n — > g and k^ 1 Hk is open, we will have that eventually 
9n9~ X £ k~ 1 Hk 1 giving that kg n {kg)~ l € H, or, which is the same, Hkg n = Hkg. 
We obtain that for sufficiently large n, wg n — wg. Therefore £, n {wg n ) — > £,(wg). 

By the extreme amenability of G, the co- induced action has a fixed point £o- 
Now we check that £,o(H) £ X is a fixed point of the action H rv X. Indeed, for any 
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h € H, h ■ £o = £o and we have 

=a(H,h)>{o(Hh) 
=s(H)hs(Hh)-%(H) 

finishing the proof. □ 



Proposition 8.2.11 can be applied to provide a short and elegant proof of the 
following. We do not know of any combinatorial proof of this fact at the same level 
of generality. 



Corollary 8.2.12 (from 49 ). Let 23 be ordered homogeneous Ramsey, and let 
c\, c n be elements o/Q3. Then (03, ci, . . . , c n ) is ordered homogeneous Ramsey 
as well. 

Proof. It is easy to see that the expansion of any homogeneous structure 23 by 
finitely many constants is again homogeneous. When 23 is additionally ordered Ram- 
sey, then Aut(23) is extremely amenable. The automorphism group of (23, c\, . . . , c n ) 
is an open subgroup of Aut(23). The statement thus follows directly from Proposi- 
tion 18.2.111 and Theorem KT2\ □ 

8.3. Canonization 

In this section we apply Ramsey theory to analyze polymorphism clones of Ram- 
sey structures 23. The central idea is that any mapping from B k — » B must 'behave 
canonically' on copies of arbitrary finite substructures of 23. We first consider the 
more general case of functions between two possibly distinct structures, and intro- 
duce a refinement of the notion of canonicity from Section |5.6.2| 

Definition 8.3.1. Let £ be a structure with domain C , and S a subset of C . Let 
23 be a structure with domain B, and let f: C —> B be a function. We say that f 
is canonical on S as a map from £ to 23 if for all n and every n-tuple t over S the 
n-type of f(t) in 23 only depends on the n-type oft in £. 

The basic lemma to apply Ramsey theory in the analysis of functions is the 
following. 

Lemma 8.3.2. Let £ be an uj- categorical ordered Ramsey structure with domain 
C and finite signature t, let 23 be an lo- categorical structure with domain B , and let 
f : C — > B be an operation. Then for all finite subsets S of C there is an automorphism 
a of £ so that the operation x f(ax) is canonical on S as a map from £ to 23. 

Proof. Let m be the maximal arity of the relations in r. Let 6 be the substruc- 
ture induced by S in £, and n :— \S\. Let < be the linear order in the signature of £ 
When 21 is a substructure of £ of size m, then a a is the tuple (oi, . . . , a m ) such that 
{oi, . . . , a m } are the elements of 21 and t± < ■ ■ ■ < t m . 

Let 2li, . . . , 2lfc be a list all non- isomorphic substructures of © of cardinality m. 
Since £ is Ramsey, there is a substructure Ci of € such that €i — > (S)^ 1 - Further, 
there is a substructure £2 of £ such that £2 — > (£i)^ 2 . We iterate this k times, 
arriving at a structure £&. For each i < k, the operation / defines a coloring \i °f 
) w ith finitely many colors as follows: the color of a copy 21 of 2li is just the type 
of /(a a ) in 23; since 23 is w-categorical, the number r of m-types in 23 is finite. 

Now going back the argument, we find that £^ contains a copy of & on which all 
colorings Xi> • • • >Xk are constant. Since £ is w-categorical, there exists an automor- 
phism a of £ that sends S to this copy. Then x 1— > f{ax) is canonical on S. □ 
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Note that the assumption that 03 is ordered is necessary in Lemma |8.3.2| for 
instance, if / is an injective function from X — > (Q; <), then / is not canonical as a 
map from (X; =) to (Q; <) on any two-element subset of X. 

8.3.1. Higher-ary functions. The appropriate generalization of canonicity for 
higher-ary functions is the following. 

Definition 8.3.3. Let 03 be a structure with domain B, and £ a structure with 
domain C . When f : C d —> B is a function, and S is a subset of C d we say that f 
is canonical on S if for all n and all n-tuples t 1 , . . . , t d where (tj, . . . ,tf) £ S for all 
i < n the n-type of fit 1 , . . . , t d ) in 03 only depends on the n-types of t 1 , . . . ,t d in €.. 
We say that f is canonical (as a d-ary map from £ to 03) if f is canonical on B d . 

When 03 = £ in the definition above, we say that / is canonical on 03 when it is 
canonical as a d-ary map from 03 to 03. 

Example 8.3.4. Let lexbe a binary operation on Q such that lex(a, b) < lex(a' , V) 
if either a < a! , or a = a' and b < b' . Clearly, such an operation exists. Note that lex 
is injective, that it preserves <, and that it is canonical as a binary polymorphism of 

(Q;<). 

In the proof of the following we use the product Ramsey theorem, Theorem |8.2.9| 

Theorem 8.3.5. Let 03 be an uj- categorical ordered Ramsey structure with domain 
B, and let f : B d — > B be any operation. Then for all finite subsets Si, . . . , Sd of 
B there are automorphisms aj. , . . . , a<j of 03 so that the operation (xi,...,Xd) > 
f(ctiXi, . . . , otdXd) is canonical on Si x • • ■ x Sd- 



Proof. By Theorem 8.2.9 the structure 03^1 is Ramsey. Hence, Lemma 8.3.2 
shows the existence of an automorphism a of 93^ such that x t— > f{ax) is canonical 
on Si x • • • x Sd as a map from 03^ to 03. 

Let G be the topological automorphism group of 03. Since the automorphism 
group of 03^1 is induced by the product action of G k on B k , there are group elements 
<zi, . . . ,dd of 25 so that a(xi, . . . , Xd)) = (aiXi, . . . , a>dXd)- Now clearly the function 
(xi, . . . , Xd) i-> f(otiX\, . . . , adXd) is canonical on S% x • • • x Sd as a higher-ary function 
on 03. □ 

8.3.2. Interpolation modulo automorphisms. One of the central questions 
when analyzing a polymorphism of a structure 03 is to find out what functions it 



generates (since those functions will also be polymorphisms of 03, see Section 5.2) 



Theorem |8 . 3 . 5 1 can be used for this purpose; to illustrate this, we present the following. 

Corollary 8.3.6. Let 03 be an uj- categorical ordered Ramsey structure. Then 
every injective operation f : B k — > B together with the automorphisms of 03 locally 
generates a canonical injective operation g. 

This corollary follows in a straightforward way from Theorem |8.3.5| and a com- 
pactness argument, which we do not present here since we will present a proper 



generalization of it in full detail, Theorem 8.3.8 



Note that when we drop the injectivity assumption for / in the statement of 
Corollary |8 . 3. 6| then the statement of the corollary becomes trivially true, since every 
operation locally generates the projections, which are canonical on the entire domain. 
We therefore need a concept that is weaker than local closure, but stronger than 
interpolation, to turn the idea of Corollary |8.3.6| into a meaningful statement for all 
functions from B k to B. 
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Definition 8.3.7. Let 03 be an u>- categorical structure with domain B, and 
f,g: B d — > B be functions. Then f interpolates g modulo automorphisms of 03 if for 
every finite S C B there are automorphisms a\, . . . ,otd of^& such that g(x\, . . . ,x c i) = 
f{a\X\, . . . , a d x d )- 

Theorem 8.3.8. Let 03 be an ordered uj- categorical Ramsey structure with domain 
B, and f : B d — > B any operation. Then there is a canonical operation g : B d — > B 
that is interpolated by f modulo automorphisms. 



Theorem 8.3.8| is still not in its most general and most useful form. For this, we 
need a further generalization of the notion of interpolation modulo automorphisms to 
the situation where / is a function from a structure £ to a different structure 03. 

Definition 8.3.9. Let 03, £ be structures with domains B and C , and let /, g: C — > 
B be functions. Then f interpolates g modulo automorphisms of £ if for every finite 
SCC there is an a € Aut(£) such that g(x) = f(ax). 

The following is the central statement about Ramsey structures and interpolation 
modulo automorphisms. 

Theorem 8.3.10. Let £ be uj- categorical ordered Ramsey with domain C , and let 
03 be uj- categorical with domain B. Then every f : C — > B interpolates a canonical 
operation g: C — > B modulo automorphisms of £. 



Proof. By Lemma 3.1.8 it suffices to show that for every finite subset C of C 
there is a function from C — >• B that is canonical on C" and interpolated by / modulo 
automorphisms of £, since the property to be canonical on C is a universal first-order 
statement about /. This follows from Lemma 8.3.2 □ 



Proof of Theorem 18.3.81 We apply Theorem |8. 3.10| to the structure £ 



03 I d l , wh ich is Ramsey when 03 is Ramsey, by Theorem |8.2.9| As in the proof of 
Theorem 8.3.5 canonicity of a function g from 03^1 to 03 translate into canonicity of 
j as a d-axy function on 03, and interpolation operations modulo automorphisms of 
03 " and 03 corresponds to interpolation of d-axy functions modulo automorphisms of 
03. □ 



Note that Theorem |8.3.8| is indeed a generalization of Corollary |8.3.6[ since clearly 
operations that are interpolated by injective operations modulo automorphisms are 
again injective. 



'Canonization' of operations as exhibited in Theorem 8.3.10 becomes particularly 
powerful when we combine it with expansions by constants. The following theorem 
has numerous applications. 

Theorem 8.3.11 (fom |49| ). Let 03 be an to -categorical ordered Ramsey structure 
with domain B and finite relational signature. Let ci, . . . , c m € B, and let f ' : B k -> B 
be any function. Then {/} U Aut((03, Ci, . . . , c m )) locally generates a function which 
is canonical as a function from (03, c\, . . . , c m ) k to 03, and which is identical with f 
on all tuples containing only values Ci. 



sey. 



PROOF. By C orollar y 
By Theorem |S.2.9 



.2.12 the structure (03, ci,. 

(03, ci,. 



) is also ordered Ram- 
is ordered Ramsey 



the structure £ := (W, ci, . . . , c 
(and still w-categorical) . Then Theorem 8.3. 10| shows that / interpolates modulo 
automorphisms of £ a canonical operation g. In particular, g is locally generated 
by {/} U Aut((03, c%, . . . , c m )), and the restrictions of / and g to {ci, . . . , c m } are 
equal. □ 
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8.3.3. Behavior of Operations. It is sometimes important to work with op- 
erations that exhibit a 'behavior' that is only partially canonical. The following 



definition from 47 gives us some flexibility in specifying such functions. 



Definition 8.3.12. Let £ and 58 be structures with domains C and B, and let 
k > 1. An (n-)type condition between £ and 58 is a k + 1-tuple (i 1 , . . . , t d , s), where 
each ti is an n-type in £, and s is an n-type in 58. A d-ary function f : C d — > B 
satisfies an n-type condition (t 1 , . . . , t d , s) on S C C d if for all n-tuples a 1 of type t l 
in £ with (a\, . . . , a l d ) £ S for all i < d, the n-tuple (f{a\, . . . , af ), . . . , f(a^, ■ ■ ■ , a d )) 
is of type s in 58 . 

A behavior between two structures £ and 58 is a set A of type conditions. A 
function f : C — > B has behavior A on S C C d if it satisfies all the type conditions 
of A on S. We say that f has behavior A if it has behavior A on all of C d . 

Note that a d-ary operation /: £ d — > 58 is canonical if for all n > 1 and all 
d-tuples (£,..., t d ) of types of n-tuples in £ there exists a type s of an n-tuple in 
58 such that / satisfies the type condition (t 1 ,... ,t d ,s). When 58 is homogeneous 
in a relational signature with maximal arity n, then already the n-type conditions 
determine the behavior of functions over 58. 

When 58 is w-categorical then the clone generated by Aut(58) and a canonical 
d-ary function / over 58 is completely described by the behavior of /. In fact, when 
f,g: B d —> B are functions with the same behavior, then {/} U Aut(58) generates g, 
and {g} U Aut(58) generates /, by local closure. 

Lemma 8.3.13. Let 58 be ordered ui -categorical with domain B. Let A be a behavior 
for functions from 58 fe to 58, and let g: B k — > B be arbitrary. If for every finite 
substructure 521 of 58 there are copies 21 1 , . . . , 21& o/ 21 in 58 such that g has behavior 
A on 21 1 x • ■ • x 52lfe then {g} U Aut(58) locally generates a function f of behavior A. 



Proof. A direct consequence of Lemma 3.1.8 □ 



An orbit of (58, ci, . . . , c m ) is called full if it contains copies of all finite substruc- 
tures of 58. The following follows from Lemma 8.3.13 As we will see in Chapter [9] 



and Chapter [TOl it becomes important in the context of canonization after expansions 



by constants (Theorem 8.3.111 



Lemma 8.3.14. Let 58 be lj -categorical, and c±, . . . ,c m be elements from 58. When 
521 is the substructure induced in 58 by a full orbit O of (58, ci, . . . ,c m ), and f is a 
function from (58, ci, . . . , c m ) k to 58 with behavior A on O, then {/} U Aut(58) locally 
generates a function from 58 k to 58 with behavior A. 

Not every behavior A between £ and 58 is realized by a function in the sense 
that there exists a function from C — > B that has behavior A. We give an example. 
There are eight distinct candidates for canonical behavior of injective maps from 



(Q; <) 2 to (Q;<); they are illustrated in Figure 8.1 However, only four of those 
canonical behaviors are realized by binary injective polymorphisms of (Q; <); those 
are illustrated in Figure |8.2| The others would imply the existence of three points 
x, y, z in the image such that x < y and y < z and z < x, which is impossible over 
(Q; <). The not necessarily injective case can be analyzed similarly, and we get the 
following. 

Lemma 8.3.15. Let f be a canonical binary polymorphism o/(Q; <). Then f has 
the same behavior as one out of the following seven operations. 

• lex{x,y) or lex(y,x); 

• lex(x,—y) orlex{y,—x). 
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Figure 8.1. Canonical behavior on [2] 2 grids. 





Figure 8.2. When / is a canonical binary injective polymorphism 
of (Q; <), then there is one linear order of a [2] 2 grid as depicted here 
such that all [2] 2 subgrids of Q 2 are linearly ordered in this way. 



• (x,y) H> x or {x,y) h 

• a constant operation 



Together with Corollary |8. 3. 6[ we find that every binary injective polymorphism 
of (Q; <) locally generates lex(x,y), lex(y,x), lex(x, —y), or lex(y, —x). 

8.3.4. Canonical Violation. Let £ be an w-categorical ordered Ramsey struc- 
ture, and let 03 be a structure with a first-order definition in €. Suppose that a 
relation R does not have a primitive positive definition in 03. We wish to show that 
then there exists a polymorphism of 03 that violates R and is canonical as a func- 
tion over £. Boldly stated like this, this cannot hold true. However, the results 
from Section [8.2. 4| show us how to fix the statement. To illustrate the basic idea, we 
first discuss the unary case, with existential positive definability instead of primitive 
positive definability. 

Theorem 8.3.16. Let £ be an u>- categorical ordered Ramsey structure, and let 03 
be a structure with a first-order definition in £. Suppose that the k-ary relation R does 
not have an existential positive definition in 03 . Then there exists an endomorphism 
e of '03 and a k-tuple t — (t%, . . . , tk) £ R such that 

• e(t) <£ R 

• e is canonical as a map from (<£, t\, . . . , £&) to £. 

Proof. The structure 03 is w-categorical. If R does not have an existential 
definition, then by Theorem |3.4.7| there is an endomorphism e' of 03 which violates R, 
that is, there is a fc-tuplc t = (ti, . . . , t k ) £ R such that e'(t) R. By Theorem 8.3.11 
e'U{Aut(03, t%, . . . , ifc)} locally generates an operation e that is canonical as a function 
from (03, t\, . . . , tk) to 03 that has the same restriction to {t\, . . . , t^} as e', and e has 
the required properties from the statement of the theorem. □ 
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Here comes the higher- ary analog of Theorem |8.3.16| whose proof is analogous to 
the proof of the previous theorem. 

Theorem 8.3.17. Let €. be an lj- categorical ordered Ramsey structure, let % be a 
structure with a first- order definition in £, and suppose that the k-ary relation R does 
not have a primitive positive definition in 03. Then there exists a finite d, a d-ary 
polymorphism f offe, and k-tuples t 1 , . . . ,t d £ R such that 

. f{t\....t d )£R 

• f is canonical as a map from (£, t 1 ) El ■ ■ ■ E (£, t d ) to €.. 
Proof. If R does not have a primitive positive definition in 03, then since 03 



is w-categorical, by Theorem 5.2.3 there is a polymorphism /' of 03 which violates 
R. By Lemma 5.3.5 we can assume that the arity of /' equals the number d of 



orbits of fc-tuples contained in R, which is bounded by o g ( fc). So there are fc-tuples 
t\...,t d £ R such that f'(t\ . . . , t m ) £ R By Corollary 



5.2.12 



for all z < d the 

structure (£,*') is Ramsey, and by Theorem |8.2.9| the structure (C,^) E ••• E (£,t d ) 



is Ramsey. Then Theorem 8.3.10 shows that /' interpolates modulo automorphisms 
of (£, t 1 ) E ■ ■ ■ E (£, t d ) a canonical operation /, and f{t x , ...,t d ) = fit 1 , ...,t d ) £ 
R. Since / is in particular locally generated by polymorphisms of 03, it is itself an 
polymorphism of 03. □ 

We are now in the situation to prove the following, which has been announced 
already in Section 5.3.5| 



Theorem 8.3.18 (from 49 ). Let 03 be a structure with finite relational signature, 
and with a first-order definition in an ordered homogeneous Ramsey structure £ with a 
relational signature of maximal arity k. Then there are finitely many minimal closed 
clones above Pol(03). 

Proof. Every minimal closed clone above Pol (03) is locally generated by a mini- 
mal operation / (Proposition 5.3.14 ), and by Theorem 5.2.4 there must be a relation R 



in 03 that is violated by /, that is, there are t 1 , . . . , t d g R such that fit 1 , . . . , t d ) £ R. 
Since / is a minimal operation, Theorem |8 . 3 . 1 7| implies that / must be canonical as a 
map from (<£, t 1 ) El • • • El (<£, t d ) — > £. But since £ is homogeneous is a finite relational 
signature, and 03 has finite relational signature, there are only finitely many canonical 
behaviors of such operations; since two minimal operations with the same behavior 
locally generate the same closed clone above Pol(03), we are done. □ 



8.4. Decidability Results for Meta-Problems 

We turn to another application of the ideas of the previous sections. For a fixed 
structure £ with a finite relational signature r and domain C, consider the following 
computational problem. 

Expr-fo(£) 

INSTANCE: Quantifier-free first-order r-formulas <fio, . . . , cj) n defining the relations 
Ro , . . . , R n over £. 

QUESTION: Is there a first-order definition of R in (C; Ri, . . . , Bn)1 

We are also interested in the variants of this problem where we replace first-order 
definability by other syntactically restricted versions of definability, in particular by 
primitive positive definability. The corresponding computational problem for prim- 
itive positive definability is denoted by Expr-pp(C) (and the problem for existential 
and existential positive definability by Expr-ex(C) and Expr-ep(C), respectively). 
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For finite structures € the problem Expr-pp(C) is in co-NEXPTIME (and in 
particular decidable). For the variant where the finite structure T is part of the 
input, the problem has recently shown to be also co-NEXPTIME-hard |189| . An 
algorithm for Expr-pp(£) has theoretical and practical consequences in the study of 
the computational complexity of CSPs for structures that are first-order definable in 
£, as illustrated in the following examples. 

Example 8.4.1. We can use an algorithm for Expr-pp(£) to decide whether all 
polymorphisms of a structure (C; Ri, . . . , R n ), given by r-formulas (pi, . . . , <p n that de- 
fine Ri, . . . , R n over £, are essentially unary. For that, we simply apply the algorithm 
to x 7^ y V y 7^ z, (pi, ... , (p n , for each i; here we use Proposition |5.3.3| 

Example 8.4.2. To decide whether a structure (C; Ri, . . . , R n ), again given by 
T-formulas <pi , . . . , <p n that define Ri , . . . , R n over £, is a core, we apply the algorithm 
for Expr-pp(£) to ->(f>i, (pi, . . . , (p n , for each i. Additionally, we apply the algorithm to 
x 7^ y,(pi, ■ . ■ , (p n - The structure (C; Ri, . . . , R n ) is a core if and only if none of those 
calls reports false, that is, all the relations defined by -nfo or by x ^ y are primitive 
positive definable in [C;Ri, . . . , R n )- 

Example 8.4.3. We can use an algorithm for Expr-pp(C) to effectively test the 



hardness condition for CSP(03) given in Proposition 5.5.10 for structures 03 with a 



first-order definition in £ and a finite relational signature. 

The main result of this section is the decidability of Expr-pp(C) for a certain class 
of structures £. Even for the simplest of countable structures, namely the structure 
(X;—) having no relations but equality, the decidability of Expr-pp(T) is not obvious 
(see [28| ). Recall the concept of finitely bounded structures £ (Definition |3.2.7 ): we 



require that the age of £ is given by a finite set of finite forbidden induced substruc- 
tures. 

Theorem 8.4.4 (from 49 ). Let £ be of finite relational signature, and first- order 
definable over a structure 2) which is homogeneous, ordered, Ramsey, finitely bounded, 
and with finite relational signature. Then Expr-pp(C) is decidable. 

Proof. Let D be the domain of D and £. Suppose we are given the formulas 
(pQ, (pi, . . . , 4>k that define the relations R , Ri, ■ ■ ■ , Rk over £. Set 03 to be the struc- 
ture (D; Ri, . . . , R k ); note that every automorphism of 03 is also an automorphism 
of £ and 25. We will decide whether there is a primitive positive definition of Rq in 
03 . We can without loss of generality assume that in each formula 4>q , . . . , <pk , the 
variables are called xi, . . . , x v , for some p. 

By Theorem |8.3.17[ if R is m-ary and does not have a primitive positive def- 
inition in 03, then there exists a finite d, a d-&ry polymorphism / of 03, and m- 
tuples t 1 , . . . , t d S R such that f{t l , . . . , t d ) ^ R , and / is canonical as a map from 
(SM 1 ) • • • ES (D, t d ) to D. Such a polymorphism of 03 will be called a witness at 
t 1 , . . . , t d (for the fact that i?o is not primitive positive definable in 03). The question 
whether such a witness exists for a specific choice of tuples t 1 , . . . ,t d does of course 
only depend on the orbits of t 1 , . . . , t d in £>, and by w-categoricity of 2) there are only 
finitely many such orbits. Moreover, by homogeneity of J), the orbits of n-tuples are 
in one-to-one correspondence to the n-element induced substructures of 2), which can 
be effectively stored and enumerated on a computer. So it suffices in the following 
to consider the case where t 1 , . . . ,t d are fixed, and to show how to decide whether a 
witness exists at this choice of t 1 , . . . ,t d . 

Since expansions of homogeneous structures by finitely many constants are homo- 
geneous, the one-to-one correspondence between orbits of Z-tuples, maximal Z-types, 
and induced Z-element substructures of D extends to the structures (J),t l ). In the 
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following, let n be max(3, n' + 1) where n! is the maximal arity of the relations in 2). 
Then by homogeneity of 2) the behavior of / is determined by the n-type conditions 
that are satisfied by / (for this property we only need that n > n'; the requirement 
n > 3 is motivated by the way how we treat equality in our approach, as we will see 
later). When / is canonical, then the set A of n-type conditions can be viewed as 

a function from Sn ' x • • • x Si ' to S„ . By w-categoricity of (T),t l ) and of T> 
there are only finitely many such functions A. 

We decide the existence of a witness by reduction to a finite-domain constraint 
satisfaction problem. The domain of the CSP is the set of all n- types of 2). The 
instance of the CSP has a different variable for every d-tuple (Si, . . . , S d ) where Si 
is an n-type of (2),i 4 ); in fact, we identify the variables of the instance and those d- 
tuples of n-types. The constraints are described below. The idea is that the solutions 
to this CSP are exactly the functions A for witnesses as described above. 

To implement this in detail, it will be convenient to make the assumption that 
Af is minimal in the sense that it does not contain structures 2li,2l2 such that 21 1 
is an induced substructure of 2I2; this assumption is without loss of generality since 
otherwise we remove 2I2 from Af, and find that the resulting set of structures still 
bounds D. 

• (Compatibility.) Note that every behavior of a witness must have an ex- 
tension to a function from Sp 3 '' ' x •■• x S^' 1 ^ to Sf, for all 1 < I < n. 

Hence, when (Si, . . . , S d ), (Ti, . . . ,T d ) e St?'* 1 * x ■ ■ • x si V ^\ and I C [n], 
and if for all i < d the subtype of Si induced by / and the subtype of Tj 
induced by / coincide, then we impose the binary constraint that / induces 
the same subtype in A(Si, . . . , S d ) £ S® and in A(T X , . . . ,T d ) € S®. 

• (Realizability.) We also want to make sure that the behavior A can be 



realized by an operation (recall the example given in Section 8.3.3 1. The idea 
is that when 2) is finitely bounded, then A should not force the existence of 
one of the forbidden substructures in the image, since in this case it would 
be impossible to find an operation with image in 2) whose behavior is A. As 
we will see, it suffices here to consider structures 21 G M whose number of 
elements s exceeds n. 

For each structure 21 € Af with s > n elements and each sequence 
Si,...,S d with Si G Si s;t,) for alH < d we have a constraint of arity 
r := (*). Let ai,...,a s be the elements of 21. Observe that for every 
subset / C [s] with |/| = n the structure induced by {a^ | i € 1} in 21 is an 
induced substructure of 2), by the minimality assumption on M . Let 
be the formula with variables X\, . . . ,x n that contains for i±, . . . ,i m G / 
the conjunct R(xi 1 , . . . , Xj m ) if and only if (a.^ , . . . , a,i m ) £ By the 

observation we just made, ^[Z] is contained in a unique n-type of 25. 

The constraint of arity r requires that for some / C [s] with \I\ = n the 
subtype of A(Si, . . . , S d ) induced by I does not contain 
(Violation.) We want that A is the behavior of an operation that violates the 
m-ary relation Rq. For simplicity of presentation, we assume that m > n; 
this is without loss of generality, since we can otherwise add dummy variables 
to 0o • 

For t = (ii, . . . , t m ) and ix,...,i n € [to] with «!<•••< i n , denote by 
t[{ii, . . . , i n }} the tuple (t il , . . . , t in ). When I C [m] we denote by (j>o[I] the 
subtype of {0o} induced by /; here, {4>o} is viewed as a type over the empty 
theory. We add the ( m )-ary constraint that for some I C [to] of cardinality 
n the type A^p^'' 1 ) (t l [I\), . . . , tp^* 1 ) (?[!])) does not contain <f> [I]. 
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• (Preservation.) We also want that A is the behavior of an operation that 
preserves 23. Let j < k, and suppose that the relation Rj of 03 defined by <j>j 
is p-ary. For simplicity of presentation, we assume that p > n, otherwise we 
add dummy arguments to Rj. For every list Si, ■ . . , Sd such that Si € Sp®' 1 ' 
contains 4>j for all i < d, we impose the following constraint of arity q — ( p j . 
For all I C [p] with |/| = n, let S( be the subtype of Si induced by /, and 
let Sq be the subtype of {<fij} (of the empty theory) induced by /. We add 
the constraint that A(S[, . . . , S^) contains Sq for all JC [p]. 

We now prove that there is a witness / at t 1 , . . . , t d for the fact that Rq is not 
primitive positive definable in 03 if and only if the described CSP instance has a 
satisfying assignment, which concludes the proof. For the easy direction, suppose 
that there exists such a witness, and let A be its behavior. Then A clearly satisfies 
compatibility, rcalizability, violation, and preservation constraints. 

For the opposite direction, suppose that a is a solution to the described CSP, i.e., 
a mapping from S^' 1 x • • • x S^' 1 ^ to 5® that satisfies compatibility, realizability, 
violation and preservation constraints. We show the existence of a witness / at 
t 1 , . . . , t d in three steps. 

We first construct an infinite structure € with domain D d of the same signature r 

as 33 as follows. When a 1 , . . . , a d € D n are such that a % £ S^'* \ then for R 6 a the 
relation R((a\, . . . , af), . . . , (a\, . . . , a d )) holds in l£ if and only if R{x\, . . . , x n ) is con- 
tained in A( tp^ 25 '* ' (a 1 ) x ■ • • x tp^' ) (a d )) . This is well-defined by the compatibility 
constraints. 

Next, we consider the relation ~ on the domain of € such that a\ ~ ai for 
ax,G2 € D d if and only if there exist a^,...,a n € D d such that the subtype of 
A((tp( s '* 1 )((a}, . . .,fl£)) x • ■ ■ x tp( 33 '*' i )(af, . . . ,a£)) induced by {1,2} contains a?i = 
X2- Note that since n > 3, the relation ~ must be an equivalence relation (since the 
properties of an equivalence relation can be formulated with a universal formula with 
three variables). Then the quotient structure <E/^ is defined to be the r-structure 
whose elements are the equivalence classes E/ ^ of ~, and where R(Ei, . . . , E p ) holds 
for a p-axy R € r and Ei, . . . , E p 6 E / ^ if and only if there are 6i € Ei, . . . , b p € E p 
such that R(bi, . . . , b p ) holds in £. 

The final step is to show that there exists an embedding / of <E/~ into 55. By 
w-categoricity of 33 and Lemma |3.1.5| it suffices to show every finite substructure 21 of 
(£/^ embeds into 2). Since D is finitely bounded by AT, we thus have to show that no 
structure in M embeds into 21. Suppose to the contrary that there is an embedding 
e of J € TV into 21. Let ui , . . . , u s be the elements of Pick any elements Vi , . . . , v s 
from the equivalence classes e(ui), . . . , e(u s ), respectively. The rest of the paragraph 
is devoted to the argument that the mapping e' that maps itj to is an embedding 
of 5" into l£, which contradicts the realizability constraints. It is obvious that e' is 
injective. To see that it is a strong homomorphism, let R be an (n — l)-ary symbol 
from r; the case that R has a smaller arity can be dealt with by adding dummy 
variables. Note that in the following we use the assumption that n is strictly larger 
than the maximal arity of D; intuitively, we implement Leibniz' law for equality. We 
have R(e(ui 1 ), . . . , e(u ip )) if and only if there are v' ix € e(ui ± ), . . . , v\ € e(u ip ) so that 
Riv'^ , . . . , v[ v ) holds in £. Since the formula R(xi, . . . , x p ) A x p — x p+ i is contained 
in 

A(tp( s ^)(<[l],...,uUl],«Jl]),...,tp( s ' td )(<[d]),...,^[d],^„[d])) , 

this type of S3 must also contain R{xi, . . . , x p —i, x p +i). In this way we argue suc- 
cessively for all arguments of R, and finally obtain that R(vi 1} ...,Vi ) holds in €. 
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The argument can be reverted, and we have that e' is a strong homomorphism. We 
conclude that e' is an embedding. 

Observe that the mapping g from D d to D that maps u to f(u/^,) is a polymor- 
phism of 05, by the preservation constraints, and it is canonical by construction. By 
the violation constraints, g violates i?o, and hence is the desired witness. □ 

Analogously to the proof of this theorem, one can show the following. 

Theorem 8.4.5 (from [49]). Let £ be with finite relational signature, and first- 
order definable over a structure 2) which is ordered, homogeneous, Ramsey, finitely 
bounded, and with finite relational signature. Then Expr-ex(£) and Expr-ep(£) are 
decidable. 

An important open problem is whether the method can be extended to show 



decidability of Expr-fo(£), under the same assumptions on £ as in Theorems 8.4.4 
and |8.4.5| By the theorem of Ryll-Nardzewski, first-order definability is character- 
ized by preservation under automorphisms, i.e., surjective self-embeddings. But the 
requirement of surjectivity is difficult to deal with in our approach. 

Question 8.1. Let 95 be with finite relational signature, and definable in a struc- 
ture £ which is ordered, homogeneous, Ramsey, finitely bounded, and with finite rela- 
tional signature. Is Expr-fo(Q5) decidable? 

While the conditions of Theorem |8.4.4| might appear rather restrictive at first 
sight, they actually are quite general: we want to point out that we only require that 
£ is first-order definable over an ordered, homogeneous, Ramsey, and finitely bounded 
structure, rather than requiring that £ itself to have these properties. We do not know 
of a single homogeneous structure £ with finite relational signature which does not 
satisfy the conditions of Theorems |8.4.4| and |8.4.5| Examples of structures £ that do 
satisfy the assumptions, and the corresponding references, see Section |8.1| 



We finally show that the assumption in Theorem 8.4.4 of £ being finitely bounded 
is necessary. 

Proposition 8.4.6. There exists a homogeneous ordered Ramsey structure £ 
with finite relational signature such that Expr-fo(£), Expr-pp(£) 7 Expr-ep(£), and 
Expr-ex(£) is undecidable. 



Proof. Recall the definition of Henson digraphs from Example 3.2.6 When C 
is the age of a Henson digraph £', then the class C consisting of all structures obtained 
from the digraphs in C by adding an arbitrary linear order on the vertices, is again 
an amalgamation class. In fact, it is even a Ramsey class by the results described in 
Example |8.1.9| Let £ denote its Fraisse-limit. 

We first show that non-isomorphic Henson digraphs £i and £2 have distinct 
Expr-pp problems. In fact, we show the existence of a first-order formula 4>± over di- 
graphs such that the input with </> := E(x, y) and </>i is a yes-instance of Expr-pp(£i) 
and a no-instance of Expr-pp(£ 2 ), or vice versa. Since there are uncountably many 
Henson digraphs, but only countably many algorithms, this clearly shows the existence 
of Henson digraphs £' such that Expr-pp(£') is undecidable. It follows that for the 
ordered Ramsey structure £ described above the problem Expr-pp(£) is undecidable 
as well. 

Since €\ and £2 are non-isomorphic, there must be a structure 21 that embeds to 
£1 but not to £ 2 , or that embeds to £ 2 but not to £1. Assume the former is the case; 
in the latter, simply exchange £1 and £ 2 . Let s be the number of elements of 21, and 
denote the elements by ai, . . . , a s . Let tp be the formula with variables x%, . . . , x s that 
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has for distinct i,j < s & conjunct E(xi,Xj) if E{a il aj) holds in 21, and a conjunct 
—>E(xi, Xj) A i, ^ Xj otherwise. Let <f> be the formula ip E(x s +i, £ s +2)- 

Let Ci be the domain of £i, and consider the relation Ri C (Ci) s+2 defined by 
(/> in Ci. Let R be a relation symbol of arity s + 2, and 25 be the structures with 
signature {R}, domain C\, and where R denotes the relation R\. It is clear that 
3xi, . . . , x s . R(xi, . . . , x s , x, y) is a primitive positive definition of E(x, y) in 23. 

Now consider the relation R 2 defined by <f) m C 2 over the domain C 2 . Since 21 
does not embed into £2, the precondition of </> is never satisfied, and the relation R2 
is empty. Hence, the structure (C2;i?2) is preserved by all permutations. But the 
relation E(x, y) is certainly not first-order definable over a structure that is preserved 
by all permutations. □ 



CHAPTER 9 



Schaefer's Theorem for Graphs 




Jaroslav Nesetfil, 2004 



This chapter is based on results from 46 48 186 



9.1. Motivation and the Result 



In an influential paper in 1978, Schaefer 175 classified the computational com- 



plexity of CSP(Q3) for all structures 25 over a two-element universe; the result and a 
simple proof have been presented in Section [5.4| Schaefer's theorem can be viewed as 
a complexity classification for systematic syntactic restrictions of the Boolean satisfi- 
ability problem, as reflected in the following formulation of the result. 

Let \& = {t/ji, ... , ip n } be a finite set of prepositional (Boolean) formulas. 

Boolean-SAT^) 

INSTANCE: Given a finite set of variables W and a propositional formula of the form 
$ = fa A • • • A fa where each fa for 1 < i < I is obtained from one of the formulas ip 
in \E' by substituting the variables of tp by variables from W. 

QUESTION: Is there a satisfying Boolean assignment to the variables of W (equiva- 
lently, those of $)? 



Schaefer's theorem (Theorem 5.4.31 states that Boolean-SAT^) can be solved 
in polynomial time if and only if ^ is a subset of one of six sets of Boolean formulas 
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(called 0-valid, 1-valid, Horn, dual-Horn, affine, and bijunctive) , and is NP-complete 
otherwise. 

We prove a similar classification result, but for the first-order logic of graphs 
instead for propositional logic. More precisely, let E be a relation symbol which 
denotes an antireflexive and symmetric binary relation and hence stands for the edge 
relation of a (simple, undirected) graph. We consider formulas that are constructed 
from atomic formulas of the form E(x, y) and x = y by the usual boolean connectives 
(negation, conjunction, disjunction), and call formulas of this form graph formulas. 
A graph formula $(a;i, . . . , x m ) is satisfiable if there exists a graph H and an m-tuple 
a of elements in H such that $(a) holds in H. 

The problem to decide whether a given graph formula is satisfiable can be very 
difficult. For example the question whether the Ramsey number i?(5, 5) is larger 
than 43 (which is an open question, see e.g. [88] ) can be easily formulated in terms 
of satisfiability of a single graph formula. Recall that i?(5, 5) is the least number 
k such that every graph with at least k vertices either contains a clique of size 5 
or an independent set of size 5. So the question whether R(5, 5) is greater than 43 
can be formulated using 43 variables x±, . . . , X43 by imposing the constraints that all 
variables denote a different vertex in the graph, and by imposing for every subset of 
the variables of cardinality five a constraint that forbids that the corresponding five 
variables form a clique or an independent set. If there is a solution to this instance, 
then this implies that R(5, 5) < 43, and otherwise i?(5, 5) > 43. 

Similarly as in Schaefer's theorem, we systematically investigate restrictions of 
the satisfiability problem for graph formulas that can be solved in polynomial time. 
Let ^ = {ipi, . . . ,ip n } be a finite set of graph formulas. Then iff gives rise to the 
following computational problem. 

Graph-SAT(^) 

INSTANCE: Given a set of variables W and a graph formula of the form $ = 
fa A ■ • • A fa where each fa for 1 < i < I is obtained from one of the formulas ip 
in \£ by substituting the variables from ip by variables from W. 
QUESTION: Is $ satisfiable? 

Example 9.1.1. Let iff be the set that just contains the formula 

(E(x,y)A^E(y,z)A^E(x,z)) 

V ( r E{x,y)AE(3f,z)A^E(x,z)) 

V (-^E(x,y)A^E(y,z)AE(x,z)) . 

Then Graph-SAT^) is the problem of deciding whether there exists a graph such 
that certain prescribed subsets of its vertex set of cardinality at most three induce 
subgraphs with exactly one edge. The problem Graph-SAT( , 5) is NP-complete. 

Example 9.1.2. There are also many interesting tractable Graph-SAT problems, 
for instance when 'J consists of the formula 

(E(x,y)A^E(y,z)A^E(x,z)) 

V (^E(x, y) A E(y, z) A -iE(x, z)) 

V (^E(x, y) A ->E(y, z) A E(x, z)) 

V (E(x, y) A E(y, z) A E(x, z)) . 

It is obvious that the problem Graph-SAT('I') is for all $ contained in NP. The 
goal of this chapter is to prove the following dichotomy result. 
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Theorem 9.1.3. For all the problem Graph-SAT('J') is NP- complete or in P. 
Moreover, the problem to decide for given ^ whether Graph-SAT('I') is NP-complete 
or in P is decidable. 

We establish our result by translating Graph-SAT problems into CSPs. More 
specifically, for every set of formulas '3/ we present a relational structure 58* such that 
Graph-SAT('I') is equivalent to CSP(58*). The relational structure 58* has a first- 
order definition in the random graph (V; E) introduced in Chapter|3] This perspective 
allows us to use polymorphisms to classify the computational complexity of Graph- 
SAT problems as outlined in Chapter [5j Our proof also relies crucially on strong 
results from structural Ramsey theory. Following the technique from Chapter |8j we 
use such results to find regular patterns in the behavior of polymorphisms of structures 
on (V; E), which in turn allows us to find analogies with polymorphisms of structures 
on Boolean domains. Our dichotomy result can be stated as follows. 

Theorem 9.1.4. Let 58 be a relational structure with a first-order definition in 
(Y;E). Then exactly one of the following two statements applies. 

(1) there is a primitive positive interpretation of ({0, 1}; 1IN3) in 58. In this 
case, 58 has a finite- signature reduct with an NP-hard CSP, by Corollary \5.5.8\ 

(2) 58 has a 4~ary polymorphism f and automorphisms a\,cti such that 

f(y, V, x, x) = ctxf{x, x, x, y) = a 2 f(y, x, y, x) 

for all x,y G V. In this case, every finite- signature reduct of 58 has a 
polynomial-time tractable CSP. 



The proof of this theorem can be found at the end of Section 9.5 The algorith 



mic part of Theorem |9.1.4| is obtained by combinations of infinite domain constraint 



satisfaction techniques (such as presented in Section 6.2 1, and by reductions to the 



tractable cases of Schaefer's theorem (Theorem 5.4.31. 

In the remainder of this chapter, 58 denotes a relational structure with a first- 
order definition in the random graph (V; E) . Since all the polymorphism clones of 
this chapter contain the automorphism group Aut((V;i?)) of the random graph, we 
also make the following convention, which exclusively holds for this chapter. For a 
set of functions F and a function g on the domain V, we say that F generates g 
when F U Aut((V; E)) locally generates g; also, we say that a function / generates g 
if {/} generates g. That is, in this paper we consider the automorphisms of (V; E) 
be present in all sets of functions when speaking about the local generating process. 

9.2. Endomorphisms 

The goal of this section is the proof of Proposition |9.2.3| which allows us to 
reduce the classification task for 58 to the classification of those structures 58 where 
the relations E and N :— {(u, v) € V 2 | ~^E(u,v) A u ^ v} are primitive positive 
definable. 

Definition 9.2.1. For all k > 3, let P( fe ) denote the k-ary relation that holds on 
X\, . . . , Xk £ V if X\, . . . , Xfc are pairwise distinct, and the graph induced by {x\, . . . , Xk} 
in (V; E) is neither an independent set nor a clique. 

Definition 9.2.2. Let T be the 4-ary relation that holds on x\, x%, 2:3, £4 € V if 
Xi,X2,X3,Xi are pairwise distinct, and induce in (W:E) either 

• a single edge and two isolated vertices, 

• a path with two edges and an isolated vertex, 

• a path with three edges, or 

• a complement of one of the structures stated above. 
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Proposition 9.2.3. Let 58 be first-order definable in (V;E). Then at least one 
of the following holds. 

(a) 58 has a constant endomorphism. In this case CSP(58) is trivial. 

(b) 58 is homomorphically equivalent to a countably infinite structure that is pre- 
served by all permutations of its domain. Such structures have been classified 
in Chapter^ 

(c) There is a primitive positive definition of P 1 - 3 ^ or T in 58. In this case 
CSP(58) is NP-complete. 

(d) The relations N , E, and ^ have primitive positive definitions in 58. 

To prove the proposition, we first cite a result about the reducts of the random 
graph due to Thomas |187| ; the formulation of the result presented here first appeared 



46 . The graph (V; E) contains all countable graphs as induced subgraphs. In 
particular, it contains an infinite complete subgraph, denoted by K u . It is clear that 
any two injective operations from V — > V whose images induce K u in (V; E) generate 
one another. Let be one such operation. Similarly, (V; E) contains an infinite 
independent set, denoted by 1^. Let e^v be an injective operation from V — > V whose 
image induces J w in (V; E) . 

Theorem 9.2.4. Let 58 be first-order definable in (Y;E). Then one of the fol- 
lowing cases applies. 

(1) 58 has a constant endomorphism. 

(2) 58 has the endomorphism e^. 

(3) 58 has the endomorphism ejv- 

(4) The endomorphisms of 58 are locally generated by the automorphisms of 58 . 



Corollary 9.2.5 (from [46] ) . All relational structures 58 with a first- order def- 
inition in (V; E) are model- complete. 

Proof. By Theorem |3~6Jl an cj-categorical structure 58 is model-complete if 
and only if Aut(58) is dense in the monoid ^# of self-embeddings of 58. We apply 



Theorem 9.2.4 to ^£ ', which, as a closed monoid containing Aut((V;i?)), is also an 
endomorphism monoid of a structure 58' with a first-order definition in (V; E). Clearly, 
58' and 58 have the same automorphisms, namely those permutations in ^ whose 



inverse is also in Therefore we are done if the last case of Theorem 19.2.41 holds. 
Note that ^# cannot contain a constant operation as all its operations are injective. 
So suppose that ./# contains e^ - the argument for is analogous. Let R be any 
relation of 58, and 4>r be its defining quantifier- free formula; (j)R exists since (V; E) has 
quantifier-elimination. Let tpR be the formula obtained by replacing all occurrences 
of E by false; so is a formula over the empty language. Then a tuple a satisfies 
4>r in (V; E) iff ejv (a) satisfies <j) R in (V; E) (because ejy is an embedding) if and 
only if ej\r(a) satisfies ipn in (V;i?) (as there are no edges on ejv(a)) if and only if 
ejv(ct) satisfies ipR in the substructure induced by ejv[V] (since tpn does not contain 
any quantifiers). Thus, 58 is isomorphic to the structure on eAr[V] which has the 
relations defined by the formulas ipR. Therefore, 58 is isomorphic to a structure with 
a first-order definition over the empty signature. This structure has, of course, all 
injections as self-embeddings, and all permutations as automorphisms, and hence is 
model-complete; the same is true for 58. □ 

The last case in Theorem |9 . 2 ,4| splits into five sub-cases, corresponding to the five 
locally closed permutation groups that contain Aut((V; E)) exhibited by Thomas 1 186 . 
Knowledge about these groups will be important for the complexity classification, and 
we will next cite the theorem that lists them. 
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If wc flip edges and non-edges of G, then the resulting graph is isomorphic to G (it 
is straightforward to verify the extension property). Let — be such an isomorphism. 
For any finite subset S of V, if we flip edges and non-edges between S and V \ S 
in (Y;E), then the resulting graph is isomorphic to (Y;E) (it is straightforward to 
verify the extension property). For any non-empty set S, we write is for such an 
isomorphism. Note that when S and T are two finite non-empty subsets of V, then 
is and generate one another. We also write sw for i{ }, where G V is any fixed 
element of V. 

Let be the fc-ary relation that holds on x\,...,xu G V if Xi,...,Xk are 

pairwise distinct, and the number of edges between these k vertices is odd. Note that 
i?( 4 ) is preserved by — , i?' 3 ' is preserved by sw, and that R^ is preserved by — and 
by sw, but not by all permutations of V. We can now describe the structures with a 
first-order definition in the random graph up to first-order interdefinability. 

Theorem 9.2.6 (of [186] ). Let 58 be a relational structure with a first-order 
definition in (Y; E) . Then exactly one of the following is true. 

(1) 58 is first- order interdefinable with (Y;E); equivalently, Aut(58) = Aut((V;i?)). 

(2) 58 is first-order interdefinable with (V; R^); equivalently, 58 is preserved by 

— , but not by sw. 

(3) 58 is first- order interdefinable with (Y;R^); equivalently, 58 is preserved by 
sw, but not by — . 

(4) 58 is first- order interdefinable with (Y;R^); equivalently, 58 is preserved by 

— and by sw, but not by all permutations ofY. 

(5) 58 is first-order interdefinable with (V;=); equivalently, 58 is preserved by 
all permutations ofY. 

In particular, the reducts (Y;P^) and (V; T) are both first-order interdefinable 
with one of the five structures of this theorem; we will now show with which one, and 
prove hardness for both reducts. We start with (V; P^ 3 )). 

PROPOSITION 9.2.7. The structures (Y;P^) and (Y;R^) are first- order inter- 
definable. 

Proof. First observe that the relation P^ is not preserved by sw: if xi,X2 G V 
are so that E(x\,X2) and N(0,Xi) hold for i = 1,2, then (0, £1,212) G P^ 3 \ but 
{swj(0), sw(xi), sw(x 2 )} is a clique in (Y;E), so (sw(0), sw(xi), s w(x 2 )) P^ 3 K On 



the other hand, P' 3 ' is clearly preserved by — . Hence, Theorem 9.2.6 implies that 



(V; P (3) ) is first-order interdefinable with (V; R^). □ 

For k > 3, let QW be the fc-ary relation that holds for a tuple (xi, . . . ,x^) G 
V fe iff Xi,...,Xk are pairwise distinct, and (x%, . . . ,x^) ^ P( k \ In other words, 
(x\, . . . , Xk) G if and only if x\, . . . ,Xk induce in (V; E) either a clique or an 

independent set of size k. 

Lemma 9.2.8. There is a primitive positive definition of in (Y;P^). 

Proof. Observe first that =1 is primitive positive definable from P^ by the 
formula 3u. P^ 3 \x, y, u). Let 8 be the primitive positive formula that states about 
the tuple (xi, . . . ,xq) that all its entries are distinct. Let be a conjunction of 
atomic formulas with variables from x\,...,x% that contains for each three-element 
subset {u, v,w} of those variables a conjunct P^(u,v,w). It is known that every 
two-coloring of the edges of the graph K 6 (the clique with six vertices) contains a 
monochromatic triangle. Therefore, <p A 5 is unsatisfiable. Let <// be a conjunction 
over a maximal subset of the conjuncts of </> with the property that <j)' A 5 is still 
satisfiable, and suppose without loss of generality that the conjunct P*- 3 -* (x\, x 2 , X3) 
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Figure 9.1. Illustration for the relation T. 



of (j) is missing in <j)' . We claim that ip := 3xa,x$,xq {ft A 5) defines Q^ 3 \ To see 
this, suppose that fisa triple that satisfies ip; that means that there exists a 6-tuple 
s extending t which satisfies ft A 5. From the maximality of ft, we infer that s, and 
hence also t, does not satisfy P^(xi,x 2 ,x 3 ), so t e Q' 3 '. Therefore, V defines a 
subset of Q( 3 \ Observe also that ip does not define the empty set since ft A S, and 
hence also ip, is satisfiable. Let t be any tuple satisfying if). Since P*- 3 -* is preserved 
by — , one can check that consists of only one orbit in (V;P ( - 3 - > ). Thus, for any 
tuple w in Q^> there exists an automorphism of (Y;P^) that sends t to w. This 
automorphism clearly preserves if), and hence w satisfies if), proving the claim. Next 
observe that Q^ 3 \x, y, u) A Q^ 3 '(y, u, v) A Q^(x : y, v) is equivalent to Q^(x, y, u, v), 
and hence has a primitive positive definition from p( 3 K □ 

Proposition 9.2.9. There is a primitive positive interpretation of ({0, 1}; NAE) 
m (V;P^), and CSP((V; P^)) is NP-complete. 

Proof. We give a 2-dimensional primitive positive interpretation / of ({0, 1}; NAE). 
The domain formula 5j(xi, #2, J/a) is 

3z 1 ,z 2 (Q { - A) {xi,x2,z 1 ,z 2 ) A Q {i) (z 1 ,z 2 ,y 1 ,y 2 )) ■ 
The formula NAEj (xi ,x 2 ,yi,y 2 , zi, z 2 ) is 

A Q (4) (yi,y 2 ,f,w) A Q (4) (zi,z 2 , . 
These two formulas are equivalent to primitive positive formulas over (V;P( 3 ') by 



Lemma 9.2.8 The coordinate map sends a tuple (xi,x 2 ) to 1 if E(x\,x 2 ) and to 



otherwise. The second part of the statement now follows from Corollary |5. 5. 8| □ 

We now turn to the structure (V; T) . For an illustration of the relation T, see 
Figure |9.1[ The first row shows all graphs, up to isomorphism, that are induced by 
4-tuples from the relation T (note that T is totally symmetric, i.e., whenever we have 
a tuple in T and we permute its components, we obtain again a tuple in T). The 
other two rows show the graphs induced by 4-tuples of distinct elements of V that are 
not from T, again up to isomorphism. We denote the four vertices of those graphs by 
1, 2, 3, and 4, as shown in the picture. 



PROPOSITION 9.2.10. The structure (V; T) is first-order interdefinable with (V; R^). 
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Proof. The relation T is obviously not preserved by all permutations of V, but 
preserved by — . It can be checked that T is preserved by sw. Theorem |9 . 2 . 6| implies 
that it is first-order interdefinable with (V; R^ ) . □ 

Lemma 9.2.11. Suppose that sw € Aut(*B). Then T consists of one orbit of 
4-tuples with respect to 03. 

Proof. For a graph on {1,2,3,4} and a vertex v € {1,2,3,4}, we say in the 
following that we flip the graph at v when we produce a new graph by nipping the 
edges and non-edges that are adjacent to v. Now consider the graphs of the first 
row of Figure |9.1[ which are those corresponding to tuples in T. If we flip the first 
one at 2, we obtain the second one; flipping the second graph at 3 yields the third 
one; and flipping the third graph at 4 yields the fourth one. Finally, flipping the 
first graph at 4 yields a graph isomorphic to the fifth graph. Since this flipping 
operation corresponds to applications of sw (more precisely: switches with respect to 
an appropriately chosen point in V) to tuples in T, and isomorphism of these 4- vertex 
graphs to applications of appropriately chosen automorphisms of (V; E) , we get that 
indeed any tuple in T can be transformed into any other tuple in T by applications 
of sw and automorphisms of (V; E) . □ 

We need the following auxiliary lemma before proving hardness for CSP((V;T)). 

Lemma 9.2.12. The relation T' consisting of all 4-tuples with distinct entries in 
V that do not belong to T is primitive positive definable in (V;T). 

Proof. Observe first that 7^ is primitive positive definable from T by the formula 
Bu, v. T(x, y, u, v). Let S be the primitive positive formula that states about the tuple 
{x\, . . . , Xxs) that all its entries are distinct. 

It is a well-known fact that every two-coloring of the edges of the graph Ki$ con- 
tains a monochromatic clique of size 4. Let <j> be the conjunction over all atomic for- 
mulas of the form T(a, 6, c, d) where {a, b, c, d} is a 4-element subset of {xi, . . . , xis}. 
By the above fact, <p A 5 is unsatisfiable. 

Let c/>o denote the conjunction over a maximal subset of the conjuncts of cp with 
the property that (po A 5 is satisfiable, and let T(y\, . . . , 7/4) denote a conjunct from <p 
that is not a conjunct of <po- 

We claim that T" is defined by the formula ip obtained from (po A 5 by existentially 
quantifying all variables except for y\, . . . , 7/4. To see this, suppose that t is a 4-tuple 
that satisfies ip; this means that there exists a 18-tuplc s extending t which satisfies 
4>o A 8. From the maximality of <pQ, we infer that s, and hence also t, does not satisfy 
T(yi, . . . ,2/4), so t <E T' . Therefore, ijj defines a subset of T' . Observe also that ip 
does not define the empty set since 4>a A 8, and hence also ip, is satisfiable. Let t be 



any tuple satisfying ip. As in Lemma 9.2.11 one can check that T' consists of only 



one orbit in (V; T) . Hence, for any tuple w in T there exists an automorphism of T 
that sends t to w. This automorphism clearly preserves ip, and hence w satisfies ip, 
proving the claim. □ 

Let R be the relation {(t x , t 2 , t 3 , t 4 ) € {0, 1} | t\ + t 2 + t 3 + t 4 = 2}. 

Proposition 9.2.13. There is a primitive positive interpretation of ({0, 1}; R) in 
(V;T) ; and CSP((V;T)) is NP -complete. 

Proof. Before we give the interpretation, we introduce a ('link'-) relation LCV 6 
that is of central importance in this proof. For a finite subset S of V, write #S for 
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the parity of edges between members of S. Now define 

L := | x £ V 6 | the entries of x are pairwise distinct, and 

#{xi,X 2 ,X 3 } = #{x 4 ,X 5 ,X 6 }} . 

Let </>(x, y, z, u, v, w) be the conjunction over all atomic formulas of the form T'(a, b, c, d) 
for every four-element subset {a, b, c, d} of {x, y, z, u, v, w}. We claim that 

ip : = 3yi, J/2, 2/3 (0(^1, a;2,a;3, 2/1,2/2,2/3) A <p(y 1 ,y 2 ,y 3 ,x 4 ,x 5 ,x 6 )) 

defines L(xi, . . . , x 6 ). 

Observe first that L is preserved by sw and — . Moreover, since T' is preserved by 
sw and — , so is ip. We can therefore take the liberty of applying sw and — to tuples 
when showing the equivalence of ip and L. 

Let t £ V 6 be a tuple with pairwise distinct entries. We first show that when t £ L 
then it also satisfies ip. Applying sw and — to t repeatedly, we can assume without loss 
of generality that N{t 1 ,t 2 ) A N(t 2 ,t 3 ) A N(t 1: t 3 ) and N(t 4 , t 5 ) A N(t 5 , t e ) A N{t 4 , t 6 ). 
Choose values si,s 2 ,s 3 £ V for the variables 2/1,2/2,2/3 such that between any two 
distinct vertices x, y from {ti, . . . ,te, s\, s 2 , s 3 } we have N(x,y). This satisfies all 
conjuncts in ip. 

For the converse, we suppose that t satisfies ip. Let Si,s 2 ,s 3 £ V be witnesses 
for 2/1,2/2,2/3 that show that ip holds for t. Assume for contradiction that t (£ L, i.e., 
4t z {x\,x 2 ,x 3 \ 7^ #{2:4, X5, xq}. Applying sw and — , we can assume without loss of 
generality that N(h,t 2 ) A N(t 2 ,t 3 ) A iV(*i,t 3 ) and E(U,t 5 ) A E(t 5 ,t 6 ) A E(t 4 ,t 6 ). 
Moreover, by applying sw and — again, we can assume that {si,S2,S3} induces a 
clique (or an independent set, which would be the same for our further argument). 
Because of T' (ti,t 2 ,t 3 , si), either the relation E holds between all pairs {ti,S]_}, 
{^2,si}, and {t 3l s\}, or the relation N holds between all these pairs. Similarly, we 
can observe the same for s 2 and S3, so one of the two possibilities occurs twice. So 
assume wlog. that E holds for any pair p, q with p £ {ti, t 2 , t 3 } and q £ {s\, s 2 }. But 
then T'(ti, t 2 , si, s 2 ) does not hold, a contradiction. 

Our primitive positive interpretation / of ({0, 1}; R) has dimension three. The 
domain formula <$/(xi, X2, X3) of our interpretation is x\ ^ x 2 A xi ^ x 3 A x 2 ^ x 3 . 
The formula =/ is L(xi,x 2 ,x 3 ,yi,y 2 ,y 3 ). The formula Ri(x±, x\, x\, . . . , xf, x|, x|) 
is 

32/1,2/2,2/3,2/4 T(y 1 ,...,y 4 ) 

A L(x{,xl,xl,y 2 ,y 3 ,y 4 ) 
A L(x 2 1 ,xl,xj,y 1 ,y 3 ,y 4 ) 
A L(xf,x|,x|,2/i,2/2,2/4) 
A L(xi,X2,xf,2/i,2/2,2/3) 

Finally, the coordinate map sends a tuple (01,02,03) for pairwise distinct 01,02,03 
to 1 if i?( 3 )(ai, 02, 03), and to otherwise. 

It can be checked easily that R is not preserved by constant operations, min, max, 
minority, or majority, and by Theorem |5.4.3| there is a primitive positive definition 
of NAE in ({0, 1}; R), and CSP(({0, 1}; R)) is NP-hard. NP-hardness of (V; T) now 
follows from Theorem 15.5.61 □ 

We are now ready to prove Proposition |9. 2.3] which shows that we can in the rest 
of our classification project focus on those structures *B that contain the relations N, 
E, and 7^. 
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Proof of Proposition 19.2.31 Suppose E does not have a primitive positive 
definition in 03. We have that E consists of just one orbit of pairs in (V; E), and thus, 
since Aut(QS) D Aut((V; .£?)), also in 03. Hence, Lemma 5.3.5 shows the existence of 
an endomorphism e of 03 that violates E. 



Theorem 9.2.4 states that either all endomorphisms of 03 are generated by its 
automorphisms, or 03 has a constant endomorphism, or the endomorphism eg, or 
the endomorphism ejy. If 03 has a constant endomorphism we are in Case (a) and 
done. If 03 has the endomorphisms orejv, then we are in Case (b) since e_g;[V] and 
6 jv [V] induce structures in (V; E) which are invariant under all permutations of their 
domain. So assume in the following that 03 has neither e^, nor e^r, nor a constant as 
an endomorphism, and that all endomorphisms of 03 are generated by Aut(03). This 
implies that Aut(03) does not equal Aut((V;i?)), and hence it contains either — or 
sw, by Theorem |9.2.6| 

Suppose that Aut(03) contains — . If the relation is primitive positive defin- 
able in 03, then CSP(03) is NP-complete by Proposition 9.2.9 and Lemma 1.2.6 and 



we are in Case (c). Otherwise, since the relation P^ consists of only one orbit of 
triples in 03 and by Lemma [5.3.5[ there exists an endomorphism e\ of 03 that violates 
p( 3 > . Since ei is generated by Aut(03), there is also an automorphis m of 3 that 
violates P^ , and hence also by Proposition 9.2.7 Thus, Theorem 9.2.6 shows 
that Aut(03) contains sw. 

We can thus henceforth assume that Aut(03) contains sw. Then the relation T 
consists of just one orbit of 4-tuples with respect to 03, by Lemma |9.2.11| If the 
relation T is primitive positive definable, then we are in Case (c) and CSP(03) is 
NP-hard by Lemma 1.2.6| and Proposition |9.2.13| Otherwise, since T consists of one 
orbit in 03, Lemma |5.3.5 implies that there is an endomorphism e2 of 03 that violates 
T. Since is genera ted by Aut(03), there is an automorphism of 03 violating T, and 

there is an automorphism of 03 violating R^ . It then 
that Aut(03) contains all permutations, bringing us back 



9.2.10 



9.2.6 



thus by Proposition 
follows from Theorem 
into Case (b). 

The reasoning for the case when N has no primitive positive definition in 03 is 
dual. So we may assume that both E and N are primitive positive definable. Then 
so is ^ since x ^ y iff 3z (E(x, z) A N(y, z)). □ 



9.3. Higher Arity Polymorphisms 

In the following we assume that 03 = (V; E, N, ^, . . . ) has a first-order definition 
in (V; E) and contains the relations E, N and ^. While in the last section, we only 
dealt with endomorphisms and automorphisms of structures, the remaining cases 
require the study of higher arity polymorphisms of 03. 

We remark that since (V; E) has only binary relations, a function / : V fe — > V is 
canonical if and only if it is 2-canonical (Section 5.6.2 1. The polymorphisms that 
imply tractability of CSP(03) will be canonical with respect to (V;E). We now 
define different behaviors that some of these canonical functions might have. For 
Qi, . . . , Qk £ {E, N, —, we will in the following write Qi ■ ■ ■ Qk for the binary 



relation on 
1 < i < k. 



that holds between two fc-tuples x,y £ V iff Qi(xi,yi) holds for all 



Definition 9.3.1. We say that a binary injective operation f: V 2 — > V is 

• balanced in the first argument if for all u, v £ V 2 we have that E=(u,v) 
implies E(f(u), f(v)) and N={u,v) implies N (f (u) , f (v)) ; 

• balanced in the second argument if (x, y) i-> f(y, x) is balanced in the first 
argument; 
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• balanced if f is balanced in both arguments, and unbalanced otherwise; 

• i?-dominatcd (iV-dominated) in the first argument if for all u,v € V 2 with 
^={u,v) we have that E{f{u)J{v)) (N(f(u),f(v))); 

• i?-dominated (JV-dominated) in the second argument if (x, y) i-> f(y, x) is 
E -dominated (N -dominated) in the first argument; 

• ^-dominated (iV-dominated) if it is E-dominated (N -dominated) in both 
arguments; 

• of type min if for all u, v € V 2 with ^^(u, v) we have E{f(u), f{v)) if and 
only if EE(u, v); 

• of type max if for all u, v € V 2 with f^(it, v) we have N(f(u), f(v)) if and 
only if NN(u, v); 

• of type pi if for all u,v € V 2 with =/=^{u,v) we have E(f(u),f(v)) if and 
only if E(ui,Vi); 

• of type p 2 if (x, y) ^ f(y, x) is of type p x ; 

• of type projection if it is of type p\ or p 2 . 

Note that, for example, being of type max is a behavior of binary functions that 
does not force a function to be canonical, since the condition only talks about certain 
types of pairs in V 2 , but not all such types; however, being of type max and balanced 
does mean that a function is canonical. The next definition contains some important 
behaviors of ternary functions. 

Definition 9.3.2. An infective ternary function f : V 3 — > V is of type 

• majority if for all ii,n £ V 3 we have that E(f(u),f(v)) if and only if 
EEE(u,v), EEN(u,v), ENE(u,v), or NEE{u, v); 

minority if for all x, y € V 3 we have E(f(x), f{y)) if and only if EEE(u, v), 
NNE{u,v), NEN(u,v), or ENN(u,v). 



While the polynomial-time tractability results of this section will be shown by 
means of a number of different canonical functions, all hardness cases will be estab- 
lished by the following single relation. 

Definition 9.3.3. We define the 6-ary relation H(xi,yi, X2,y2, xs,ys) onW by 

ij'6{l,2,3},i^i,«e{xi,j/i},tiG{x ; ,-,8/ 3 } 

A ({{E( Xl , yi ) A N(x 2 , y 2 ) A N(x 3 , y 3 )) 

V (N( Xl , yi ) A E(x 2 ,y 2 ) A N(x 3 ,y 3 )) (10) 

V {N(x 1 ,y 1 )AN(x 2 ,y 2 )AE(x 3 ,y 3 ))). 

The goal of this section is to prove the following proposition, which together with 
Proposition 9.2.3 proves Theorem |9.1.4| 

PROPOSITION 9.3.4. Let *B = (V; E, N, ^, . . . ) be first-order definable in (V; E). 
Then at least one of the following holds: 

(a) There is a primitive positive definition of H in 2$. In this case, CSP(*8) is 
NP-complete. 

(b) 25 has a canonical polymorphism of type minority, as well as a canonical 
binary injection which of type p\ and E-dominated or N -dominated in the 
second argument. In this case, CSP(*B) is polynomial-time tractable. 

(c) 25 has a canonical polymorphism of type majority, as well as a canonical 
binary injection which of type p\ and E-dominated or N -dominated in the 
second argument. In this case, CSP(25) is polynomial-time tractable. 
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(d) 03 has a canonical polymorphism of type minority, as well as a canonical bi- 
nary injection which is balanced and of type projection. In this case, CSP(03) 
is polynomial-time tractable. 

(e) 03 has a canonical polymorphism of type majority, as well as a canonical bi- 
nary injection which is balanced and of type projection. In this case, CSP(03) 
is polynomial-time tractable. 

(f) 03 has a canonical polymorphism of type max or min. In this case, CSP(03) 
is polynomial-time tractable. 

The remainder of this section contains the proof of Proposition |9.3.4 except for 

For the 



the polynomial-time tractability proofs, which will be given in Section 9.4 



other statements of Proposition 9.3.4 we proceed as follows. In Section 9.3. 1| we 
show that the relation H is hard. In particular, a structure 03 = (V; E, N, ^, . . .) 
with a first-order definition in (V; E) must have an essential polymorphism, or has a 
finite reduct with an NP-hard CSP. In Section [9~3T2l we show when 03 has an essential 
polymorphism, it must also have a binary injective polymorphism. We finally prove 
in Section 9.3.4 that 03 has one of the polymorphisms listed in cases (b) to (f) of the 
proposition. 

9.3.1. Hardness of H. This section is devoted to case (a) of Proposition 9.3.4| 

Proposition 9.3.5. There is a primitive positive interpretation of ({0, 1}; HNS) 
in (V;H), and CSP((V;ff)) is NP-hard. 

PROOF. We give a 2-dimensional interpretation I of ({0, 1}; 1IN3) in 03. The 
domain formula is true. The formula =/ (xi, x 2 , yi, y 2 ) is 

3zi, z 2 , ui,u 2 , vi,v 2 (H(xi,X2,ui,u 2 , z\,z%) A N(ui,u 2 ) 

A H(z 1 ,z 2l vi,v 2 ,yi,y 2 ) A N(vi,v 2 )) 

This formula is equivalent to a primitive positive formula over 03 since N(x, y) is 
primitive positive definable by H. The formula 1IN3i{x\, x 2 , j/i, y 2l Z\, z 2 ) is 

x 2l y 1: y 2 , z l7 z 2 (H(x 1 , x 2 , y l7 y 2 , z±, z 2 ) 

A =1 (x 1 ,x 2 ,x' 1 ,x 2 )A =1 (yi,y2,y[,y' 2 )A =1 (z 1 ,z 2 ,z[,z 2 )) 

The coordinate map sends a tuple (xi,x 2 ) to 1 if E(xi,x 2 ) and to otherwise. The 
second part of the statement follows from Corollary |5.5.8| □ 

9.3.2. Producing binary injections. We now show that if a structure 03 = 
(V; E, N, 7^, . . .) with a first-order definition in (V; E) has an essential polymorphism, 
then 03 must also have a binary injective polymorphism. This is in particular the 
case when the relation H from the previous section is not primitive positive definable 
in 03: since E and N are among the relations of 03, and since any essentially unary 
polymorphism preserving both E and N preserves all relations with a first-order 
definition in (V; E), we have that the polymorphism violating H must be essential. 

Theorem 9.3.6 (from [46]). Let 53 = (V; E, N, ^, . . . ) be first-order definable in 
(V; E), and suppose that 03 has an essential polymorphism. Then 03 also has a binary 
injective polymorphism. 

Proof. Let /: V fc — s- V be an essential polymorphism of 03 of minimal arity. By 
Lemma |5.3.10[ / must be binary. Hence, we may apply Lemma |6.1.3| to 03, and in 
order to show that 03 is preserved by a binary injection, it suffices to show that if 4> 
is a primitive positive formula over 03 such that both 4> A x 7^ y and <f> A s 7^ t are 
satisfiable over 03, then cf>Ax^yAs^tis satisfiable over 03 as well. The proof 



follows the idea of the proof of Theorem 6.3.1 
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Let <j> be a primitive positive formula over the signature of 23 such that 

• there is a tuple t\ that satisfies <fi A x ^ y 

• there is a tuple t 2 that satisfies (j> A s 7^ t. 

Let ai, a 2 , 03, a.4 and 61, 62, 63, 64 be the values for x, y, s, t in ti and £2, respectively. 
We have a\ 7^ 02 and 63 7^ 64. We want to show that 4>Ax^yAs^tis satisfiablc 
over 23. Thus, if a 3 7^ a 4 or 61 7^ 6 2j there is nothing to show, and so we assume that 
a 3 = a4 and 61 = b 2 . 

We claim that there are automorphisms a, (3 of (V;E) such that in the tuple 
i 3 := f(a(ti), f3(t 2 )) the value of a; is different from the value of y, and the value of 
s is different from the value of t. Then, since / preserves 23, the tuple £3 shows that 
(j)Ax^yAs^tis satisfiable over 23, and concludes the proof. 

To prove the claim, we will find tuples c :— (01,02,03,04) and d :— {di^d 2l d 3l di) 
of the same type as (ai, a 2 , 03, 04) and (61, 62, 63, 64), respectively, such that the tuple 
o := f(c,d) satisfies e\ 7^ e 2 and e 3 7^ e 4 . Then, by the homogeneity of (V;E), we 
can find automorphisms a and /? of (V; E) sending a to c and 6 to d, which suffices 
for the proof of our claim. 

In the sequel, we will assume that X(ai,a 2 ) and Y(6 3 , 64), where X, Y G {E, N}. 

Case 1. Suppose first that 03 = a 4 G {&i,a2} and 61 = b 2 G {63,64}; without 
loss of generality, 03 = a 2 and 61 = 63. 

Case 1.1 There exists u G V such that for all p, i> G V with (u,v) € Y we 
have f(p,u) = f{p,v). Then, because / preserves 7^, we have f(p,u) 7^ f(q,u) for 
all p ^ q. If for all p, 1; G V we have that f(p,u) = f(p,v), then this implies that 
for all p,v,v' € V we have that }{j>,v') = f(p,v), contradicting our assumption 
that / is essential. So there are p, v G V such that f(p,u) 7^ f(p,v). Pick w G V 
such that (w,u),(w,v) G Y. Pick moreover ijeV such that (p, g) G X. We have 
7^ /(P,«) = /few). Moreover, /(p,w) = f(p,u) ^ f(q,u) = f{q,w). Hence, 
the tuples c := (q,p,p,p) and d := (10, w, w, v) prove the claim. 

Case 1.2 For all u G V there exist p, n G V with (u, u) G Y such that /(p, it) 7^ 
f(p,v). Pick m,n,u € V with (m, n) € X and /(to, u) 7^ f(n,u). Pick p, u G V 
such that (u, v) G Y and f(p,u) 7^ f(p,v). If we can pick p in such a way that 
(p,m),(p,n) G X, then since either f(m,u) 7^ f(p,u) or f(n,u) 7^ f(p,u) we have 
that either (m,p,p,p) or (n,p,p,p) proves the claim together with the tuple (it, u, u, w). 
So suppose that this is impossible. Then for any q G V with (<?, to), (<?, n) G X we have 
f(q,u) = f(q,v) 7^ f(p,u), so we have that (q,p,p,p) and (u,u,u,v) satisfy the claim. 

Case 2. Now suppose that a 3 = 04 G {ai,a 2 } and 61 = 6 2 ^ {63,64}; wlog 
a 3 = a 2 . Write (61,63) G Q 3 and (61,64) G Q4, where Q 3 ,Q 4 G {E 1 , N}. 

Case 2.1 There exists it G V such that for all p,v,r with (u, r) G Y, (ii,i>) G Q3 
and (u,r) G Q4 we have f(p,v) = f(p,r). Then one easily concludes that for all 
peV and all v,v' G V with v,v' ^ u we have f(p,v) = f(p,v'). This implies that 
/(p, i>) 7^ f(q, v) whenever p 7^ q and d/h. Since / is essential, there exist p, v G V 
with (u, w) G Y such that /(p, 11) 7^ /(p, i>). Now pick m,?eV such that (u>, it) G Q3, 
(iu,v) G Q4, and (q,p) G X. Then f(p,w) 7^ f(q,w), and so the tuples (q,p,p,p) and 
(w,w,u,v) prove the claim. 

Case 2.2 For all u there exist p, v, r with (u, r) G Y, (u, w) G Q3, (u, r) G Q4 and 
/(p> u ) 7^ /(Pi r )- Pick TO > n , u with (to, n) G X and /(to, u) 7^ /(n, w). Pickp, u,reV 
such that (w, r) G Y, (u, w) G Q3, (u, r) G Q4 and /(p, w) 7^ /(p, r). If we can pick p in 
such a way that (p, to), (p, n) G X, then either (m,p,p,p) and (w, it, f,r) or (n,p,p,p) 
and (u,u 7 v 7 r) prove the claim. So suppose that this is impossible. Then for any q 
with (g, m), (q, n) G X and all v,r G V with (u,r) G Y, (u, u) G Q3, (u,r) G Q4 
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we have f(q,v) — f(q,r). This implies that for all such q and all v,v' 7^ u we have 
f(q,v) = f(q,v'). Pick w such that (w, v) G Q3, (w,r) G Q4. Pick q such that 
(q,p) G X. We have f(q,w) 7^ f(p,w), and so {q,p,p,p) and (w, w,v,r) prove the 
claim. 



Case 3. To finish the proof, suppose that 03 = 04 ^ {01,(12} and 61 = 62 ^ 
{63,64}. Write (a 3 ,a!) £ Pi, (a 3 ,a 2 ) G P 2 , (bi,h) G Q 3 and (64,64) £ Q4, where 
P,Q, G {P,^}. 

Case 3.1 There exists u such that for all p,v,r with (v,r) G Y, (u,v) G Q3 
and G Q4 we have f(p,v) = f{p 1 r). Then one easily concludes that for all 

p G V and all v, v' G V with v,v' 7^ u we have f(p,v) = f(p,v'). This implies that 
f(p,v) 7^ f(q,v) whenever p 7^ q and v ^ u. We claim that there exist p, v with 
(u, v) G y such that /(p, m) 7^ /(p, w). Otherwise, if /(p, -u) = f(p, v) for all p, V, then 
f(p, v) = f(p, v 1 ) for all p, v, v' , and / depends only on its first variable, contradicting 
the assumption that / is essential. Now pick w,m,n such that (u>,u) G Q3, (u>,v) G 
Q4, (m,n) G X, {m,p) G Pi, and (n,p) G P2. Then the tuples (m,n,p,p) and 
(u>, w, m, u) prove the claim. 

Case 3.2 For all u there exist p,v,r with (i>,r) G Y, (w,t>) G Q3, (it, r) G Q4 
and f(p,v) 7^ f{PiT). Pick m,n,u with (m, n) G X and f(m,u) 7^ f(n,u). Pick 
p,v,r such that (f,r) G Y, G Q3, (u, r) G Q4 and f(p,v) 7^ f(p,r). If we 

can pick p in such a way that (p,m) G Pi and (p, n) G P2, then (m,n,p,p) and 
(u,u,v,r) prove the claim, so suppose that this is impossible. Then for any q with 
(q,m) G Pi and (q, n) G P2 and all v, r with (v,r) G Y, (u,v) G Q3, (u,r) G Q4 we 
have f(q, v) = f(q, r). This is easily seen to imply that for all such q and all v, v' 7^ u 
we have f(q, v) = f(q, v'). Pick w such that (w, v) G Q3, (u>, r) G Q4, and w ^ u. Pick 
<?,</ such that (g, g') G X, (q,p) G Pi and (q',p) G P2. We have f{q,w) 7^ f(q',w), 
and thus (q,q',p,p) and (w,w,v,r) prove the claim. □ 

9.3.3. Minimal Binary Functions. Let ^ be the clone generated by Aut((V; E)). 
We know from Theorem 19.2.41 and Theorem 19.3.61 that all essential functions that are 
minimal above ^ are binary injective, and preserve both E and TV. It is the goal of 
this section to determine these binary minimal functions. To state the main result, 
we define the dual of an operation / on (V; E), which can be imagined as the function 
obtained from / by exchanging the roles of E and N. 

Definition 9.3.7. The dual of a function f(xi, . . . ,x n ) on (V;P) is the function 
-f(-xi,...,-x n ). 

Theorem 9.3.8 (from [46]). 7/53 = {¥; E, N,^, . . .) is first- order definable in 
(V; E) and has an essential polymorphism, it must also have at least one of the fol- 
lowing binary injective canonical polymorphisms. 

• a balanced operation of type pi; 

• a balanced operation of type max; 

• an E-dominated operation of type max; 

• an E-dominated operation of type p\; 

• a binary operation of type p\ that is balanced in the first and E-dominated 
in the second argument; 

or one of the duals of the last four operations (the first operation is self- dual). 

Our proof of Theorem |9.3.8| makes essential use of the Ramsey techniques from 
Chapter [8] As we have seen in Example |8.1.6[ the class of all finite graphs is not 
a Ramsey class. However, the class of all finite ordered graphs is a Ramsey class 



(see Example 8.1.7). This class is clearly an amalgamation class, and we denote its 
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Fraisse-limit by (V;E, <). Note that the reduct of this structure without the order 
has the extension property, and hence is isomorphic to the random graph. It therefore 
makes sense to use the same symbol V for the elements of (V; E, <) and the elements 
of the random graph. Also note that the reduct (V; <) is isomorphic to (Q; <). By 



the argument above, the following is a direct consequence of Corollary 8.3.6 



COROLLARY 9.3.9. Every essential function that is minimal above the clone gen- 
erated by Aut((V;£J)) is a binary injection that is canonical as a function from 
(V;£,<) 2 to (¥;£,<). 

In the rest of this section, canonical means canonical as a function from (V; E, <) 2 
to (Y;E,<), and minimal means minimal as an operation above ^ . The following 
behavior of functions from (V;E, <) 2 — > (Y;E,<) is useful to describe canonical 
functions. 

Definition 9.3.10. Let f: V 2 -> V, and let {Ri,R 2 } = {E,N}. If for all 
{xi,x 2 ), (2/1,2/2) G V 2 with x x < yi, x 2 < y 2l Ri(xi,Vi), and R 2 (x 2 ,y 2 ) we have 

• N{f{x\,x 2 ) 1 f(yi,y 2 )), then we say that f behaves like min on input (<, <). 

• E(f(xi, x 2 ), f(yi, y 2 )), then we say that f behaves like max on input (<, <). 

• Ri(f(x\, x 2 ), f(yi,y 2 )), then we say that f behaves like p\ on input (<, <). 

• R2(f(xi,x 2 ), f(yi,y 2 )), then we say that f behaves like p 2 on input (<, <). 
Analogously, we define behavior on input (<, >) using pairs (x\, X2), (j/i, J/2) <= V 2 
with x\ < yi and x 2 > y 2 . 

Of course, we could also have defined "behavior on input (>,>)" and "behavior 
on input (>, <)"; however, behavior on input (>, >) equals behavior on input (<, <), 
and behavior on input (>, <) equals behavior on input (<, >). Thus, there are only 
two kinds of inputs to be considered, namely "straight input" (<,<) and "twisted 
input" (<,>)• 

Proposition 9.3.11. Let /: V 2 — > V be infective and canonical, and suppose 
it preserves E and N. Then it behaves like min, max, p\ or p 2 on input (<,<). 
Moreover, it behaves like on min, max, p\ or p 2 on input (<,>). 

Proof. By definition of the term canonical; one only needs to enumerate all 
possible types of pairs x, y E V 2 . □ 

We remark that the four possibilities correspond to the four binary operations g on 
the two-element domain {E, N} that are idempotent, i.e., that satisfy that g{E,E) = 
E and g(N, N) = N. 

Definition 9.3.12. // /: V 2 -» V behaves like X on input (<, <) and like Y on 
input (<, >), where X, Y G {max, min,pi,p 2 }, then we say that f is of type X/Y. 

Fix an automorphism f-> of the graph (V; E) that reverses the order on V; 
such an automorphism clearly exists since (V; E, <) and (V; E, >) are isomorphic. 



Lemma 8.3.15 shows that any canonical binary injective polymorphism of (V; <) has 
the same behavior as lex(x, y), lex(x, -O-y), lex(y, x), or lex(y, Ox). If / is any function 
that is canonical as a map from (V; <; E) 2 to (V; <), and does not preserve <, then 
(x,y) i—)- ^-f(x,y) preserves <. Moreover, by passing from / to (x, y) t-t f(-^x,y) or 
(x,y) i—)- f{^x,y) we can assume that / behaves either like lex(x,y) or lex{y,x). 

We will now prove that minimal binary canonical injections are never of mixed 
type, i.e., they have to behave the same way on straight and twisted inputs. 

Lemma 9.3.13. Suppose that f: V 2 — > V is injective and canonical, and suppose 
that it is of type max/pi or of type pi/ max, where i £ {1,2}. Then f is not minimal. 
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Proof. We prove that / generates a binary injective canonical function g which 
is of type max/max. Clearly, all binary injective canonical functions generated by g 
then are also of type max /max, so g cannot generate /, proving the lemma. 

Assume without loss of generality that / is of type max/pi, and note that we 
assume that / behaves like lex(x,y). Set h(u,v) := Then h behaves like 

Pi on input (<, <) and like max on input (<, >); moreover, f(x\,x 2 ) < /(j/1,2/2) iff 
h(xi,x 2 ) < h(yi,y 2 ), for all x x 7^ y x and x 2 7^ y 2 . We then have that g(u,v) := 
f(f(u, v),h(u, v)) is of type max/max, finishing the proof. □ 

Lemma 9.3.14. Suppose that /: V 2 — > V is injective and canonical, and suppose 
that it is of type min/pi or of type pi/min, where i € {1, 2}. Then f is not minimal. 

Proof. The dual proof works. □ 

Lemma 9.3.15. Suppose that f: V — > V is injective and canonical, and suppose 
that it is of type max/min or of type min/max. Then f is not minimal. 

PROOF. Assume without loss of generality that / is of type max/min, and recall 
that we assume that / behaves like lex(x,y). Consider h(u,v) :~ f(f(u,v),^v). 
Then h is of type p 2 /p 2 , so it cannot reproduce /. □ 

Lemma 9.3.16. Suppose that f : V 2 — > V is injective and canonical, and suppose 
that it is of type pi/p 2 or of type p 2 /p\. Then f is not minimal. 

Proof. If / is of type pi/p 2 , then h(u, v) := f(f(u, «),Hu) is of type p 2 /p 2 and 
cannot reproduce /. If / is of type p 2 /p\, then g(u,v) := /(u,f>i)) is of type pi/p 2 
and still behaves like lex(x,y); hence, we are back in the first case. □ 

We have seen that actually no "mixed" types appear for minimal functions. In 
other words, minimal functions that are canonical as functions from (V;E,<) 2 — > 
(V;E, <) are also canonical as functions from (V;E) 2 — > (V;E). This motivates the 
following definition. 

Definition 9.3.17. Let f: V 2 -> V. We say that f behaves like min (max, p 1 , 
p 2 ) on input (7^, 7^) iff it behaves like min (max, pi, p 2 ) both on input (<, <) and on 
input (<,>). We also say that f is of type min (max, p\,p 2 ). If f is of type pi or 
p 2 then we also say that f is of type projection. 

Our observations so far can be summarized as follows. 

Proposition 9.3.18. Let f : V 2 -> V be essential and minimal. Then it is injec- 
tive, canonical as a function from (Y;E) 2 — » (V;E), and behaves like min, max, pi 
or p 2 on input (7^,7^). 

In the following, we consider further types of tuples x, y e V 2 . So far, we did not 
consider the case where x\ — y\ or x 2 = y 2 . 

Definition 9.3.19. Let f: V 2 ->• V. We say that f behaves like e E (ejv, id, -) 
on input (7^,=) iff for every fixed c£ V, the function g(x) := f(x,c) behaves like e E 
(eN, id, —). Similarly we define behavior on input (=,7^). 

If / is canonical and injective, then it behaves like one of the mentioned functions 
on input (7^, =) and (=, 7^), respectively. This motivates the following. 

Definition 9.3.20. We say that f:Y 2 ^Yis of type E/N iff f behaves like 
e E on input (7^,=) and like e^ on input (=,7^). Similarly we define the types E/E, 
N/E, E/id, E/—, etc. Moreover, we say that f is balanced iff it is of type id /id, 
we say it is i?-dominated iff it is of type E/E, and we say it is iV-dominatcd iff it is 
of type N/N. 
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In the following theorem, we finally characterize those canonical behaviors that 
yield minimal functions. 

Theorem 9.3.21. The minimal polymorphisms of (V; E, N) are precisely the bi- 
nary injective canonical operations of the following types: 

(1) Projection and balanced. 

(2) max and balanced. 

(3) min and balanced. 

(4) max and E-dominated. 

(5) min and N -dominated. 

(6) Projection and E-dominated. 

(7) Projection and N -dominated. 

(8) P2 and Ej id, or pi and id /E. 

(9) p2 and N/ id, or pi and id /N. 

Moreover, these 9 different kinds of minimal functions do not generate one another, 
and any two functions in the same group do generate one another. 



If e is essential, then it must be binary and injective by Theorem 9.3.6 The rest 



of the theorem follows from Proposition |9.3. 18| and the following lemmas. 

Lemma 9.3.22. If f is a binary canonical injection in one of the classes of The- 



9.3.21, then it generates all other functions in the same class. 



Proof. By the homogeneity of (V;E) and local closure. □ 

The verification of Lemmas |9 .3 . 23| to |9 . 3. 26| is left to the reader; the proof always 
uses induction over terms. 

Lemma 9.3.23. Any binary essential function generated by a binary canonical 
injection of type min, max, or projection, respectively, is of the same type. 

Lemma 9.3.24. Any binary essential function generated by a binary canonical 
injection that is balanced and preserves E and N is balanced. 

We thus have that the first three classes of functions of Proposition |9.3.21| arc 
indeed minimal. The following lemma proves minimality for items (4) and (5). 

Lemma 9.3.25. Any binary essential function generated by an E-dominated bi- 
nary canonical injection of type max is E-dominated. Dually, any binary essential 
function generated by an N -dominated binary canonical injection of type min is N- 
dominated. 

The following lemma proves minimality for items (6) and (7). 

Lemma 9.3.26. Any binary essential function generated by an E-dominated bi- 
nary canonical injection of type projection is E-dominated. Dually, any binary essen- 
tial function generated by an N -dominated binary canonical injection of type projec- 
tion is N -dominated. 

It remains to prove minimality for items (8) and (9), which is achieved in the 
following lemma. 

Lemma 9.3.27. Any binary essential function generated by a binary canonical 
injection of type E / id and pi is either of the same type or of type id/E and p\. 
Dually, any binary essential function generated by a binary canonical injection of 
type N / id and P2 generates is either of the same type or of type id /N and p\. 
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Proof. Let f{u,v) be of type E/id and p 2 . f( v ,u) is of type id/E and p\. 
Both f(u, f(u, v)) and f(v, f(u, v)) are of type Ej id and pi. So is f(f(u, v),v). The 
function f(f(u,v),u) is of type id/E and pi. Finally, f(f(u,v),f(v,u)) also is of 
type id/E and p\, so f cannot generate any new behaviors. □ 

Next we claim that no other functions except for those listed in Theorem |9.3.21| 
are minimal. This will be achieved in the following lemmas. 

Lemma 9.3.28. Let f be a binary canonical injection of type max. If f is not 
balanced or E-dominated, then f is not minimal. 

Proof. If / is of type E/id, then g(x,y) := f(f(x,y),x) is _E-dominated. By 



Lemma 9.3.25 g cannot reproduce /. If / is of type E/N , then g is _E-dominated as 
well. So it is if / is of type Ej—. 

If / is of type N / id, then g(x, y) := f(x, f(x, y)) is balanced, so / is not minimal 



by Lemma 9.3.24| If / is of type N/—, then g is balanced as well. 



If / is of type id /- or of type -/-, then g(x, y) := f(x, f(x, y)) is of type E/ id, 
which we have already shown not to be minimal. 

By symmetry, if we switch the arguments in a type of /, e.g., if / is of type id /E, 
then / is not minimal either. We have thus covered all possible types. □ 

Analogously, we find that every minimal binary injection of type min is balanced 
or A^-dominated. 

Lemma 9.3.29. Let f be a binary canonical injection of type p%. If f is not 
balanced, E-dominated, N -dominated, of type id / E, or of type id /N , then f is not 
minimal. 

Proof. If / is of type E/id, E/-, -/id, or -/-, then g(x,y) := f(x,f(x,y)) 
is balanced and cannot reproduce /. If it is of type E/N or id /— , then g is of type 
E/ id, and we are back in the preceding case. Dually, if / is of type N / id or N /— , 
then g is balanced. If it is of type NJ E, then g is of type N/ id, bringing us back to the 
preceding case. If it is of type — / E, then g is of type id/E and p%, and hence cannot 



reproduce / by Lemma 9.3.27 The dual argument works if / is of type —/N. □ 



Analogously, we find that every minimal binary injection of type P2 is balanced, 
-E-dominated, ./V-dominated, or of type E/ id. 

9.3.4. Producing functions that are not of type projection. Theorem |9.3.8| 

and the following proposition together imply that indeed, if case (a) of Proposi- 
tion |93]4] does not apply, then one of the other cases does. 

Proposition 9.3.30. Suppose that f is an operation on V that preserves the 
relations E and N and violates the relation H . Then f generates a binary injective 
canonical operation of type min or max, or a ternary injective canonical operation of 
type minority or majority. 

We first prove the following. 

Lemma 9.3.31. Let f be an operation on (Y;E) which preserves E and N and 
violates H . Then f generates a binary or ternary injection which shares the same 
properties. 

Proof. Since the relation H consists of three orbits of 6-tuples, by Lemma [5. 3. 5| 
/ generates an at most ternary function that violates H, and hence we can assume 
without loss of generality that / itself is at most ternary. The operation / must 
certainly be essential, since essentially unary operations that preserve E and N also 
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preserve H. Applying Theorem |9.3.8| we get that / generates a binary injective 
canonical function of type min, max, or p\. In the first two cases we are done, since 
binary injections of type min and max violate H . So consider the last case and denote 
the function of type pi by g. By adding a dummy variable, we may assume that / is 
ternary. Now consider 

h(x, y, z) := g(g{g(f(x, y, z),x),y), z) . 

Then h is clearly injective, and still violates H - the latter can easily be verified 
combining the facts that / violates H, g is of type p\, and all tuples in H have 
pairwise distinct entries. □ 



It will turn out that just as in the proof of Lemma |9.3.31 there are two cases for 



/ in the proof of Proposition |9.3.30| either all binary canonical injections generated 
by / are of type projection, and / generates an edge majority or an edge minority, or 
/ generates a binary canonical injection of type min or max. We start by considering 
the first case, which is combinatorially less involved. 

Proposition 9.3.32. Let f be an operation on (V; E) which preserves E and N 
and violates H. Suppose moreover that all binary injections generated by f are of 
type pi or p2 ■ Then f generates a canonical ternary injection of type majority or 
minority. 

PROOF. By Lemma |"9.3.3I| we can assume that / is a binary or ternary injection; 
if it was binary, it would be of type projection and thus preserve H, so it must be 
ternary. Because / violates H, there are x 1 , x 2 , x 3 £ H such that (f(xi), . . . , f(xe)) £ 
H, where Xi := (xj,x 2 ,x 3 ) for 1 < i < 6. 

If there was an automorphism a such that a{x l ) = x 3 for i ^ j < 3, then / gen- 
erates a binary injection that still violates H , which contradicts the assumption that 
all binary injections generated by / are of type projection. By permuting arguments 
of / if necessary, we can therefore assume without loss of generality that 

E{x\,x\),N{x\,x\),N{x\,xl), 

N(xlxl),E(xixl),N(xixl), 

N(xl,x 3 2 ),N(xl,xl),E(xl,xl). 

We set 

S:={y£Y 3 \ NNN(x l} y) for all i < 6} . 

Consider the binary relations Q1Q2Q3 on V 3 , where Qi £ {E,N} for 1 < i < 3; 
each of these relations defines a 2-type in (V; E) I 3 ' . We claim that for every 2-type 
t defined by one of those relations there is a 2-type s of (V; E) such that / satisfies 
the type condition (s, t) on S. To prove the claim, fix a relation Q1Q2Q3 and let 
u,v £ S be such that QiQ2Qs(u, v) holds; we must show that whether E(f(u), f(v)) 
or N(f(u),f(v)) depends only on Q1Q2Q3 (and not on u, v). We go through all 
possibilities of QiQ2Qs- 

(1) Q1Q2Q3 = ENN. Let a £ Aut((V;£)) be such that a(x\ , x\, u 2 , v 2 ) = 
(xf , xf,, U3, V3); such an automorphism exists since NNN(xi,u), NNN(xx,u), 
NNN(x2,u), NNN(x2 7 v), and since (x 2 ,x?,) has the same type as (x^x 3 ,), 
and (^2,^2) has the same type as (113,1)3). By assumption, the operation 
g defined by g(x,y) := f(x,y,a(y)) must be of type projection. Hence, 
E{g(ui,u 2 ),g{vi 1 v 2 )) iff E(g(xi,x 2 ), g{x\, x 2 )). Combining this with the 
equations (f(u),f(v)) = (g(u 1 ,u 2 ), g(vi,v 2 )) and {g(x\,x\),g(x\,xl)) = 
{f{xi),f{x2)), we get that E(f(u),f (v)) iff E(f(x 1 ),f(x 2 )), and so we are 
done. 
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(2) Q1Q2Q3 = NEN or Q1Q2Q3 — NNE. These cases are analogous to the 
previous case. 

(3) Q1Q2Q3 — NEE. Let a be defined as in the first case. By assumption, the 
operation defined by f(x,y,a(y)) must be of type projection. Reasoning as 
above, one gets that E(f(u),f(v)) iff N(f(xi),f(x 2 )). 

(4) Q1Q2Q3 = ENE or Q1Q2Q3 — EEN. These cases are analogous to the 
previous case. 

(5) Q1Q2Q3 = EEE or Q1Q2Q3 = NNN. These cases are trivial since / 
preserves E and TV. 

To show that / generates an operation of type majority or minority, it suffices 
to prove that / generates a function of type majority or minority on S (that is, has 
on S the same behavior as a function of type majority), since S contains copies of 



arbitrary finite products of substructures of (Y;E), and by Lemma 8.3.13 We prove 
this by another case distinction, based on the fact that (/(xi), . . . , J{xq)) ^ H. 

(1) Suppose that E(f(x 1 )J(x 2 )),E(f(x 3 ),f(x i )),E(f(x 5 )J(x 6 )). Then / it- 
self is of type minority on S. 

(2) Suppose that N(f( Xl ), f(x 2 )), N(f(x 3 ), /(a*)), N(f(x 5 ), f(x e )). Then / it- 
self is of type majority on S. 

(3) Suppose that E(f(x 1 ),f(x 2 )),E(f(x 3 )J(x 4 )),N(f(x 5 )J(x 6 )). Let e be a 
self-embedding of (V; E) such that for all w S V and all i < 6 we have that 
N(xi,e(w)). Then the ternary operation defined by f(x,y,e(f(x,y,z))) is 
of type majority on S. 

(4) Suppose that E(f( Xl ), f(x 2 )), N(f(x 3 )J(x 4 )), and E(f(x 5 ), f(x 6 )), or that 
N(f(xi),f(x 2 )), E(f(x 3 ),f(x 4 )), and E(f(x 5 )J(x 6 )). These cases are 
analogous to the previous case. 

Let h(x,y,z) be a ternary injection of type majority or minority generated by /; it 
remains to make h canonical. By Theorem |9.3.8| / generates a binary canonical injec- 
tion g(x, y), which is of type projection by our assumption on /. Assume without loss 
of generality that it is of type pi and set t(x, y, z) := g(x 1 g(y, z)). Then the function 
h(t(x, y, z),t(y, z, x), t(z, x, y)) is still of type majority or minority and canonical; we 
leave the straightforward verification to the reader. □ 

In order to obtain a full proof of Proposition |9.3.30| it remains to show the 
following proposition. 

Proposition 9.3.33. Let f: V 2 — >• V be a binary injection preserving E and N 
that is neither of type p\ nor of type p 2 • Then f generates a binary injection of type 
min or of type max. 

In the remainder of this section we will show this by a Ramsey theoretic analysis 
of /. The global strategy behind what follows now is to take a binary injection / 
and fix finitely many constants c from V 2 on which it can be seen that / is not of 
type projection. Then, using Theorem |8.3.II| we generate a binary canonical function 
which is identical with / on all tuples with elements from c; this canonical function 
then still is not of type projection, and can be handled more easily because it is 
canonical. To reduce the number of cases that we have to consider, we rule out some 
behaviors of canonical functions already before introducing the constants. 

Lemma 9.3.34. Suppose that f: V 2 — > V is injective and canonical, and suppose 
that it is of type max/g or of type g/max, where g € {min,pi,p2}. Then f generates 
a binary injection of type max. 
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Proof. Assume without loss of generality that / is of type max/g (when / is of 
type g/max, replace / by f(x, f-J-y), which is of type g/max). We also assume that / 
obeys p\ for the order (otherwise, continue with f(y,x) instead of f(y,xj). 

Set h(u, v) := f(u, ^v). Then h behaves like g on input (<, <) and like max on 
input (<,>); moreover, f(x il x 2 ) < / (2/1,2/2) \fi-h(x 1 ,x 2 ) < h(y 1 ,y 2 ), for all x x ^2/1 
and x 2 7^ £/2- We then have that f(f(u, v), h(u, v)) is of type max/max, which means 
that it is of type max. □ 

Lemma 9.3.35. Suppose that f : V 2 — > V is injective and canonical, and suppose 
that it is of type min/pi or of type pi/min, where i G {1,2}. Then f generates a 
binary injection of type min. 

PROOF. The dual proof works. □ 

We next consider the last remaining mixed behavior, pi/p 2 , by combining oper- 
ational with relational arguments. 

Lemma 9.3.36. Let 03 be a structure that is first- order definable in (V;E), con- 
tains the relations E, N, and is preserved by a binary injection of type p\. Then 
the following are equivalent. 

(1) 03 has a binary injective polymorphism of behavior min. 

(2) For every primitive positive formula <j) over*B, if <t>/\N(xi, x 2 )A/\ 1<i< j <i x i ^ 
Xj and <p A N(x 3 ,x i ) A Ai<i<j<4 x i x j are satisfiable over 23, then </> A 
N(xi,x 2 ) A N^x^^i) is satisfiable over 03 as well. 

(3) For every finite F C V 2 there exists a binary injective polymorphism of 03 
which behaves like min on F. 

Proof. The implication from (1) to (2) follows directly by applying a binary 
injective polymorphism of behavior min to tuples r,s satisfying <f> A N{x\,x 2 ) A 
f\i<i<j<4 x i ^ x j and <M N(x 3 ,x 4 ) A Ai< J <j<4 :E i ^ x h respectively 

To prove that (2) implies (3), assume (2) and let F C V 2 be finite. Without 
loss of generality we can assume that F is of the form {ei, . . . , e„} 2 , for sufficiently 
large n. Let 21 be the structure induced by F in 03 2 . We construct an injective 
homomorphism h from A to 03; every homomorphism can clearly be extended to a 
binary polymorphism of 03, for example inductively by using universality of (Y;E). 
We construct h in such a way that the extension behaves as min on F. 

To construct h, consider the formula <j>o with variables Xij for 1 < i, j < n which 
is the conjunction over all literals R(x il j 1 , . . . , x ik Jfc ) such that R is a relation in 
03 and R(e il , . . . , e ik ) and i?(e J1 , . . . ,e J(c ) hold in 03. So (j) states precisely which 
relations hold in 03 2 on elements from F. Since 03 is preserved by a binary injection, 
we have that (f>i := </> A l\i<i,j,k,l<n,(i,j)^{k,l) x i,o ^ x k,l is satisfiable. 

Let P be the set of pairs of the form i 2 ), {ji,j 2 )) with i\, i 2 .j\,j 2 G {!...., n}. 
h 7^ jii ii 7^ 32, and where N(a 1 ,ej 1 ) or N(ei 27 ej 2 ). We show by induction on the 
size of / C P that the formula 4>i/\A((i 1 i 2 ) (j ± j 2 ))ei N(xi 1 j 2 ,Xj 1 j 2 ) is satisfiable over 
03. Note that this statement applied to the set / = P gives us the a homomorphism 
h from 21 to 03 such that for all a, b G F we have N(h(a),h(b)) whenever EN(a,b) 
or NE(a,b) by setting h(ei,ej) :— s(xij), where s is the satisfying assignment for 

01 A A((i 1 ,i 2 ),(j 1 ,j 2 ))ep N i x ii,i2> x ji ,h)- 

For the induction beginning, let p — ((ii,i 2 ), {31,32)) be any element of P. Let 
r, s be the n 2 -tuples defined as follows. 



r := (ei, . . . , ei, e 2 , . . . , e 2 , . . . ,e„, 
s := (ei,e 2 , . . . ,e„,ei,e 2 , . . . ,e„, . 



j • • • j Cn) 

...,ei,e 2 ,...,e n ) 
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In the following we use double indices for the entries of n 2 -tuples; for example, r = 
(7*1,1, . . . , ri.„, 7*2,1, . . . , r rl ,„). The two tuples r and s satisfy <pQ. To see this, observe 
that by definition of tf>o the tuple 

((ei, ei), . . . , (ei, e„), (e 2 , ei), . . . , (e„, e„)) 

satisfies 4>q in OS 2 ; since r and s are projections of that tuple onto the first and second 
coordinate, respectively, and projections are homomorphisms, r and s satisfy 4>o as 
well. Let j be a binary injective polymorphism of 03 which is of type px, and set 
r' := g(r, s) and s 1 := g(s,r). Then r' and s' satisfy (f>i since g is injective. Since 
p E P, we have that N(ei 1 ,ej 1 ) or N(ei 2 , e j2 ). Assume that N(ei ± , e^); the other 
case is analogous. Since , 2 = e^, r^j., = e^, r' :— g(r, s), and g is of type pi, we 
have that ^(r^ ^.r'^ ^), proving that (px A N{x ilt i 2 ,Xj lt j 2 ) is satisfiable in 03. 

In the induction step, let / C P be a set of cardinality n > 2, and assume that 
the statement has been shown for subsets of P of cardinality n — 1. Pick any distinct 
Qi, 92 € /. We set 

ip '■= 4>\ A /\ N(x ilti2 ,x jlth ) 

(.{h,i2),(h,j2))&l\{qi,q2} 

and observe that -0 is a primitive positive formula over 03 (here we use the as- 
sumption that 03 contains the relations N and 7^). Write gi = ((ui, ^2)1 ( u i> V2)) 
and (?2 = ((w'x, m 2 ), (v'i, v' 2 )). Then the inductive assumption shows that each of 
ip A N(x UljU2 , x Vl;V2 ) and ip A N(x u > y , x v i y ) is satisfiable in 03. Note that ?/■ con- 
tains in particular conjuncts that state that the four variables x Ul . U2 , x Vl ^ V2 , x u < y , 
x v' 1 ,v' 2 denote distinct elements. Hence, by (2), the formula ip A N(x MltU2 ,x Vl , V2 ) A 
N(x u i y , x v i y ) is satisfiable over 03 as well, which is what we had to show. 



The implication from (3) to (1) follows from Lemma 3.1.8 □ 



Lemma 9.3.37. Let f: V 2 — > V be a binary injection of behavior px/p2 which 
preserves E and N . Then f generates a binary injection of type rain and a binary 
injection of type max. 

Proof. By Theorem |9.3.8[ / generates a binary injection of type max, min, or 
P\. Suppose first that it does not generate a binary injection of type max or min; 
we will lead this to a contradiction. Let 03 be the structure with domain V that 
contains all relations that are first-order definable in (V; E) and that are preserved 
by /. Since / generates a binary injection of type p\, we may apply implication (2) 



—> (1) from Lemma 9.3.36 Let be a primitive positive formula with variable set 
S, {xi, . . . , 2*4} C S, such that the formulas <p A N(xx,x 2 ) A f\i<j<4 x i 7^ x j an( i 
<p A N(x3,X4) A Ai<j<4 x i 7^ x j have in 03 the satisfying assignments r and s from 
S —> V, respectively. 

We can assume without loss of generality that r(xx) < r(x 2 ) and r(x 3 ) < r(xi); 
otherwise, since r(xi), . . . , r(xx) must be pairwise distinct, we can apply an automor- 
phism of (V; E) to r such that the resulting map has the required property. Similarly, 
by applying an automorphism of (V; E) to s, we can assume without loss of general- 
ity that s(xi) < s(x 2 ) and s(x4) > 5(2:3). Then the mapping t: S — > V defined by 
t(x) = f(r(x),s(x)) shows that <p A N(xi,x 2 ) A N(x 3 ,X4) is satisfiable in 03: 

• The assignment t satisfies cp since / is a polymorphism of 03. 

• We have that N(t(xx),t(x 2 )) since r(xi) < r{x 2 ), s(xx) < s(x 2 ), / is of type 
Pi on input (<,<), and N(r(xi), r(x 2 )). 

• We have that N(t(x3), t(x4)) since r(x 3 ) < r{x^), 5(23) > s(x4), / is of type 
p 2 on input (<, >), and N(s(x3), s(xi)). 
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By Lemma |9.3.36[ we conclude that 03 is preserved by a binary injection of type min, 
and consequently / generates a binary injection of type min - a contradiction. 

Therefore, / generates a binary injection of type max or min. Since the assump- 
tions of the lemma are symmetric in E and N , we infer a posteriori that / generates 
both a binary injection of type max and a binary injection of type min. □ 

Having ruled out some behaviors without constants, we now examine behaviors 
when we add constants to the language. In the sequel, we will also say that a function 
/ : V 2 — > V has behavior B between two points x, y G V 2 if it has behavior B on {a;, y}. 

Lemma 9.3.38. Let u e V 2 , and set U := (V\ {id}) x (V\ {u 2 })- Letf:Y 2 ^¥ 
be a binary injection which preserves E and N , behaves like p\ between all points 
v, w G U, and which behaves like pi between u and all points in U . Then f generates 
a binary injection of type min as well as a binary injection of type max. 

PROOF. Let 03 be the structure with domain V that contains all relations that 
are first-order definable in (V; E) and that are preserved by /. Since U contains 
copies of products of arbitrary finite graphs, / behaves like p\ on arbitrarily large 
finite substructures of (Y;E) 2 , and hence generates a binary injection of type p\ by 



Lemma 8.3.13 Hence 03 is also preserved by such a function, and we may apply the 



implication from (2) to (1) in Lemma 9.3.36 to 03 



Let <j) be a primitive positive formula with variable set S, {x\, . . . , X4} C S, such 
that <j) A N(x 1 ,x 2 ) A /\i<i<j<4 x i ^ x j an d <t> A N(x 3 ,x 4 ) A Ai<i<j<4 x i ^ x j are 
satisfiable over 03, witnessed by satisfying assignments r, s: S — > V, respectively. 

Let a be an automorphism of (V; E) that maps r(x 3 ) to U\, and let j3 be an au- 
tomorphism of (W;E) that maps s(x3) to v,2- Then (a(r(xa)), f3{s(xs))) = u, and 
v := (a(r(x4,)),fi(s(x4,))) G U since a(r(xi)) ^ a(r(xs)) — «i and /3(s(x4)) 7^ 
P( s ( x 3)) = u 2- Thus, / behaves like pi between u and v, and since s satisfies 
N(x3, X4), we have that t: S — > V defined by 

t(x) = f(a(x),[3(x)) 

satisfies N(x3,X4), too. Since a,f3,f are polymorphisms of 03, the assignment t also 
satisfies 4>. To see that t also satisfies N(xi,X2), observe that a(r(xi)) 7^ a(r(x3)) 
and f3(s(xi)) 7^ [3(s(x 3 )) : and hence p := (a(r(xi)), (3(s(xi))) £ U. Similarly, q := 
(a(r(x2)), f3(s(x2))) ^ U. Hence, / behaves aspi betweenp and g, and since r satisfies 
N(xi,X2), so does t. 

By Lemma |9.3.36| we conclude that 03 is preserved by a binary injection of type 
min, and consequently / generates a binary injection of type min. 

Since our assumptions on / were symmetric in E and N, it follows that / also 
generates a binary injection of type max. □ 

Lemma 9.3.39. Let u G V 2 , and let f : V 2 — > V be a binary injection that behaves 
like p\ between all points v,w G U := (V\ {iti}) X (V\ {^2}), and which behaves like 
P2 between u and all points in U . Then f generates a binary injection of type min. 

Proof. The proof is identical with the proof in the preceding lemma; note that 
our assumptions on / here imply more deletions of edges as the assumptions in that 
lemma, so it can only be easier to generate a binary injection of type min. □ 

Lemma 9.3.40. Let u,v e V 2 such that ^^(u,v) and set W := (V\ {ui,«i}) X 
(V \ {u2,i>2}). Let f : V 2 — > V be a binary injection that 

• preserves E and N 

• behaves like p\ between all points w, r G W 

• behaves like p\ between u and all points w G W 
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behaves like pi between v and all points w £ W 
does not behave like p\ between u and v. 
Then f generates a binary injection of type min as wella s a binary injection of type 
max. 

Proof. We have to consider the case that EN(u, v) and N(f(u), /(f)), and the 
case that NE(u,v) and E(f(u), /(«))• In the first case we prove that / generates a 
binary injection of type min; it then follows by duality that in the second case, / 
generates a binary injection of type max. 



As in Lemma 9.3.38 we apply the implication (2) — > (1) from Lemma 9.3.36 Let 
03, <p, %x, . . . , X4, and S be as in the proof of Lemma |9.3.38| by the same argument as 
before, 03 is preserved by a binary injection of type p\. 

If N(r(x 3 ), r(x4)), then the assignment r shows that cj> A N(x±, X2) A N(x 3 ,X4,) is 
satisfiable and we are done. Otherwise, since r(x3) 7^ r(x4), we have E(r(x 3 ), r(x4)). 
Therefore, there is an a £ Aut((V;£')) such that (a(r(x 3 )),a(r(x4))) = (ui,vi). 
Similarly, since N(s(x 3 ), s(x4)) and N(u2,V2), there is a f3 <E Aut((V;i?)) such that 
(j3(s(x 3 )), /3(s(x4))) — (112,1)2)- We claim that the map t: S — > V defined by 

t(x) = f(a(x),(3(x)) 

is a satisfying assignment for <p A N(xi,X2) A N(x 3 ,X4,). The assignment t satisfies 
4> since a,f3 and / are polymorphisms of 03. Then N(t(x 3 ),t(x4)) holds because 
(a(r(x 3 )),f3(s(x 3 ))) = u and (a(r(a; 4 )), /3(s(x 4 ))) = v, and N(f(u),f(v)). To prove 
that N(t(xi),t(x2)) holds, observe that r(x\) ^ r(x%) and r(x\) ^ r(x4), and hence 

a(r(xi)) 4- {a(r(x 3 )),a(r(x4))} = {ui,vx} . 

Similarly, P(s(xi)) i {0(s(x 3 )), (3(s(x 4 ))} = {u 2 ,v 2 }. Hence, (a(r(x 1 )), P{s(xi)) G 
W. A similar argument for X2 in place of Xi shows that (a(r(x2)), (3(s(x2)) € W. 
Since / behaves like p\ between all points of W, and since r satisfies N(xx,X2), we 
have proved the claim. This shows that 03 is preserved by a binary injection of type 
min, and hence / generates such a function. 

By symmetry of our assumptions on / in E and A, it follows that / generates 
a binary injection of type min if and only if it generates a binary injection of type 
max. □ 

We are now set up to prove Proposition |9.3.33[ and hence complete the proof of 
Proposition |9.3.30| 

Proof of Proposition 19. 3.331 Let / be given. By Theorem |9.3.8[ / generates 
a binary canonical injection g of type projection, min, or max. In the last two cases we 
are done, so consider the first case. We claim that / also generates a (not necessarily 
canonical) binary injection h of type min or max. Then h(g(x,y), g(y,x)) is still of 
type min or max and in addition canonical, and the proposition follows. 

To prove our claim, fix a finite set C := {c%, . . . , c m } C V such that the fact that 
/ does not behave like a projection is witnessed on C. Invoking Theorem |8. 3. 11[ wc 
may henceforth assume that / is canonical as a function from (V; E, <,c\, . . . , c m ) 2 
to (V; E, <) (and hence also to (V; E) since tuples of equal type in (V; E, <) have 
equal type in (V; E)). It is clear that this new / must be injective. 

In the following we consider orbits of elements in the structure (V; E, < , C\ , . . . , c m ) . 
The infinite orbits are precisely the sets of the form 

{v € V I Qi(v,Ci) and Ri(v,Ci) for all 1 < i < m}, 

for Qx, . . . , Q m € {E, N}, and Ri, . . . , R m € {<, >}. The finite orbits are of the form 
{cj} for some 1 < i < m. Each infinite orbit of (V; E, <, c\, . . . , c m ) contains copies of 
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arbitrary linearly ordered finite graphs, and in particular, forgetting about the order, 
of all finite graphs. 

Therefore, if / behaves like min or max on an infinite orbit of (V; E, <, Ci, . . . , c m ), 
then by Lemma |8. 3. 13 it generates a function which behaves like min or max every- 
where, and we are done. Moreover, if / is of mixed type on an infinite orbit, then, 
again by Lemma |8.3.13[ / generates a canonical function which has the same mixed 
behavior everywhere. But then we are done by Lemmas |9.3.34[ |9.3.35| and |9.3.37| 
Hence, we may henceforth assume that / behaves like a projection on every infinite 
orbit. Fix in the following an infinite orbit O and assume without loss of generality 
that / behaves like p\ on O. 

Now suppose that there exists an infinite orbit W such that / behaves like p 2 
between all points u £ O 2 and v £ W 2 for which u\ < V\ and u 2 < v 2 . Then fix 
any v € W 2 , and set Oi := {o £ O \ o < vi} and 2 := {o £ O \ o < v 2 }. Set 
0[ := 0\ U {v{\ and 2 ■= 2 U {^2}- We then have that / behaves like p 2 between 
v and any point u of (0[ \ {v±}) x (0' 2 \ {^2}), and like p\ between any two points of 
(0[ \ {v\}) x {0 2 \ {^2})- Since (O'^E, Vi) contains copies of all finite substructures of 
(Y;E,Vi), for i £ {1,2}, by Lemma 8.3. 13| we get that / generates a function which 
behaves like p 2 between v and any point u of (V\{«i}) x (V\{v2})j an d which behaves 
like pi between any two points of (V\ {vi}) x (V\ {^2})- Then Lemma 9.3.38 implies 
that / generates a binary injection of type min and we are done. 

This argument is easily adapted to any situation where there exists an infinite 
orbit W such that / behaves like p 2 between all points u £ O 2 and v £ W 2 with 
R\{u x ,V\) and R 2 {u 2 ,v 2 ), for Ri,R 2 £ {<,>}. 

When there exists an infinite orbit W such that / behaves like min between 
all points u £ O 2 and v £ W 2 with R\{u\,v{) and R 2 {u 2 ,v 2 ), then we can argue 
similarly, invoking Lemma 9.3.38 at the end. Replacing min by max we can use the 
dual argument, with the notable difference that / generates a binary injection of type 
max rather than min. 

Since / is canonical, one of the situations described so far must occur. Putting 
this together, we conclude that for every infinite orbit W and all points u £ O 2 and 
v £ W 2 , f behaves like p\ between u and v. Having that, suppose that for an infinite 
orbit W, f behaves like p 2 on W. Then exchanging the roles of O and W and of p\ and 
p 2 above, we can again conclude that / generates a binary injection of type min. We 
may thus henceforth assume that / behaves like p\ between all points u, v £ (V\ C) 2 . 

Pick any u £ C 2 . Suppose that there exists v £ (V \ C) 2 such that / does 
not behave like p\ between u and v; say without loss of generality that EN(u,v) 
and N(f(u),f(v)). Let Oi be the (infinite) orbit of Uj, for i £ {1,2}. Then for all 
v £ 0\ x 2 we have EN(u,v) and N(f(u),f(v)) since / is canonical. Now let 
w £ 2 x 0\. We distinguish the two cases E(f(u), f(w)) and N (f (u) , f (w)) . In the 
first case, / behaves like p 2 between u and all v £ (Oi U 2 ) 2 . We can then argue 
as above and are done. In the second case, / behaves like min between u and all 
v £ (Oi U 2 ) 2 , and we are again done by the corresponding argument above. We 
conclude that we may assume that for all u £ C 2 and all v £ (V\ C) 2 , / behaves like 
Pi between u and v as well. 

Now pick u, v £ C 2 such that / does not behave like p\ between u and v, say 
without loss of generality EN(u,v) and N(f(u),f(v)); this is possible since the fact 
that / does not behave like p\ everywhere is witnessed on C. Pick any 16 infinite 
orbits 0\, . . . , 0\q such that for all Qi,Q 2 ,Ri,R 2 £ {E, N} there exists w £ {0\ U 
• • - U Oi 6 ) 2 with QxQ 2 (u,w) and RxR 2 {v, w). Set Si := {tt 1 ,« 1 }U0 1 U---U(9 16 and 
S2 :== {u 2 , v 2 } U 0\ U ■ • • U Oi6- Then Sj contains copies of all finite substructures 



of (V; E, Ui, Vi), for i £ {1,2}, and hence applying Lemma 8.3.13 to functions from 
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(V; E, Ux, Vi)x (V; E, U2, ^2) to (V; E) we see that / generates a function which behaves 
like P2 between u and v, like pi between u and all points w € (V \ {u\, v\}) X (V \ 
{^2,^2}), like pi between w and all points w £ (V\{ui,Vi}) x (V \ {-«2, w 2 }), and like 
Pi between any two points w,r € (V\ {tti, t>i}) x (V\ {it2, ^2})- But then we are done 



by Lemma [9.3.401 □ 

9.4. Algorithms for Graph-SAT problems 

Throughout this section we assume that 03 is a structure with finite relational 
signature r and a first-order definition in (V; E) . 

9.4.1. The unbalanced case. We now prove tractability of the CSP for tem- 



plates 03 as in cases (b) and (c) of Proposition 9.3.4 that is, for structures with a 
first-order definition in (V; E) that have 

• a ternary polymorphism of type majority or minority, and 

• a binary polymorphism of type pi which is either _E-dominated or iV-dominated 
in the second argument. 

By duality, we may assume that the polymorphism of type p\ is iS-dominated in the 
second argument. 

It turns out that for such templates 03 we can reduce CSP(03) to the CSP of the 
a structure that we call the injectivization of 03. This implies in turn that the CSP 
can be reduced to a CSP over a Boolean domain. 

Definition 9.4.1. A tuple is called injective if all its entries have pairwise dis- 
tinct entries. A relation is called injective if all its tuples are injective. A structure 
is called injective if all its relations are injective. 

Definition 9.4.2. We define injectivizations for relations, atomic formulas, and 
structures. 

• Let R be any relation. Then the injectivization of R, denoted by inj(i?), is 
the (injective) relation consisting of all injective tuples of R. 

• Let 4>(xi, . . . , x n ) be an atomic formula in the language o/03, where x%, . . . , x n 
is a list of the variables that appear in <j). Then the injectivization of 
4>(xi, . . . , x n ) is the formula R™ 3 {x\, . . . , x n ), where R^ is a relation sym- 
bol which stands for the injectivization of the relation defined by <f>. 

• The injectivization of a relational structure 03, denoted by inj(03), is the 
relational structure € with the same domain as 03 whose relations are the 
injectivizations of the atomic formulas over^B, i.e., the relations R, . 

Note that inj(03) also contains the injectivizations of relations that are defined 
by atomic formulas in which one variable might appear several times. In particular, 
the injectivization of an atomic formula <f> might have smaller arity than the relation 
symbol that appears in <f>. 

To state the reduction to the CSP of an injectivization, we also need the following 
operations on instances of CSP(03). Here, it will be convenient to view instances of 



CSP(03) as primitive positive r-sentences (see Section 1.2 1 



Definition 9.4.3. Let <j> be an instance o/CSP(03). Then the injectivization of 
4>, denoted by inj(</>), is the instance ip of CSP(mj(03)) obtained from <f> by replacing 
each conjunct <p(x±, . . . ,x n ) of (p by R 1 ? 3 ' (x\, . . . ,x n ). 



We say that a constraint in an instance of CSP(03) is false if it defines an empty 
relation in 03. Note that a constraint R{x\, . . . , Xk) might be false even if the relation 
R is non-empty (simply because some of the variables from x±, . . . , Xk might be equal). 
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// Input: An instance <f> of CSP(QS) with variables W 

While $ contains a constraint <j> that implies x — y for x,y £ W do 

Replace each occurrence of x by y in <f>. 

If <f> contains a false constraint then reject 
Loop 

Accept if and only if inj(0) is satisfiable in inj(Q5). 



FIGURE 9.2. Algorithm for CSP(03) when 03 is preserved by an un- 
balanced binary injection, using an algorithm for inj(03). 



Proposition 9.4.4. Let 03 be preserved by a binary injection f of type p\ that is 



E -dominated in the second argument. Then the algorithm shown in Figure 9.2 is a 
polynomial-time reduction o/CSP(03) to CSP(mj(03)). 

Proof. In the main loop, when the algorithm detects a constraint that is false 
and therefore rejects, then cf> cannot hold in 03, because the algorithm only contracts 
variables x and y when x = y in all solutions to <fi - and contractions are the only 
modifications performed on the input formula <p. So suppose that the algorithm does 
not reject, and let rp be the instance of CSP(03) computed by the algorithm when it 
reaches the final line of the algorithm. 

By the observation we just made it suffices to show that ip holds in 03 if and only 
if inj(-0) holds in inj(03). It is clear that when inj(?/>) holds in inj(03) then ijj holds in 
03 (since the constraints in mj(^/>) have been made stronger). We now prove that if tp 
has a solution s in 03, then there is also a solution for inj(-0) in inj(03). 

Let s' be any mapping from the variable set W of ip to V such that for all distinct 
x, y € W we have that 

• if E(s(x),s(y)) then E{s'(x),s'(y)); 
. if N(s(x),s(y)) thenN(s'(x),s'(y)); 

• if s(x) = s(y) then E(s'(x), s'(y)). 

By universality of (V; E), such a mapping exists. We claim that s' is a solution to ip 
in 03. Since s' is injective, it is then clearly also a solution to inj(-0). To prove the 
claim, let 7 be a constraint of ip on the variables x\, . . . ,Xk € W. Since we are at 
the final stage of the algorithm, we can conclude that 7(^1, . . . ,Xk) does not imply 
equality of any of the variables Xi, . . . , and so there is for all 1 < i < j < k a tuple 
v- x 'i' such that R{t^') and ti ^ tj hold. Since 7(^1, . . . , Xk) is preserved by a binary 
injection, it is also preserved by injections of arbitrary arity (it is straightforward to 
build such terms from a binary injection). Application of an injection of arity to 
the tuples shows that 7(2:1, . . . , Xk) is satisfied by an injective tuple (ti, . . . , tfc). 

Consider the mapping r: {x\, . . . ,Xk} — > V given by r(xi) := f(s(xi),ti). This 
assignment has the property that for all i,j £ S if E(s(xi), s(xj)), then E{r{x), r(y)), 
and if N(s(xi),s(xj)) then N(r(xi), r(xj)), because / is of type p\. Moreover, if 
s(xi) = s(xj) then E(r(xi),r(xj)) because / is ^dominated in the second argument. 
Therefore, (s'(xi), . . . , s' (x n )) and (r(xi), . . . , r[x n )) have the same type in (Y;E). 
Since / is a polymorphism of 03, we have that (r(xi), . . . , r(x n )) satisfies the constraint 
7(0:1, . . . , x n ). Hence, s' satisfies 7(3:1, . . . , x n ) as well. We conclude that s' satisfies 
all the constraints of tp, proving our claim. □ 

To reduce the CSP for injective structures to Boolean CSPs, we make the following 
definition. 

Definition 9.4.5. Let t be a k-tuple of distinct vertices of (Y;E), and let q be 
(2) . Then Boole(t) is the q-tuple (01,2, a.1,3, ■ ■ ■ , »i,fc, 02,3, . . . , flfe— € {0, l} q such 
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that ciij = if N(ti,tj) and <2jj = 1 if E(ti,tj). If R is a k-ary injective relation, 
then Boole(i?) is the q-ary Boolean relation {Boolc(i) | t G R}. If (j) is a formula 
that defines an injective relation R over (V;E), then we also write Boole(</>) instead 
of Boolc(inj(i?)). Finally, for injective 05, we write Boole(05) for the structure over 
a Boolean domain with the relation Boole(i?) for each relation Rof*S. 

Proposition 9.4.6. Let*B be injective. Then there is a polynomial-time reduction 
from CSP(Q5) to CSP (Boole (05)). 

PROOF. Let <f> be an instance of CSP(*8) with variable set W . We create an 
instance ip of CSP(Boole(Q5)) as follows. The variable set of -0 is the set of unordered 
pairs of variables from <f>. When 7 = R(x±, . . . , Xk) is a constraint in $, then ^ contains 
the constraint Boole(i?)(a; 1 2 , £1,3, . . . , £i,fe, £2, 3, . . . , Xk-i,k)- It is straightforward to 
verify that \& can be computed from 4> in polynomial time, and that $ is a satisfiable 
instance of CSP(Q5) if and only if "J is a satisfiable instance of CSP(Boole(*35)). □ 



The following proposition, together with Propositions 9.4.4| and |9.4.6[ solves the 
case where Pol(Q5) contains a ternary injection of type minority or majority as well as 
one of the functions of Theorem |9.3.8| which are unbalanced and of type projection. 



It thus shows tractability of cases (b) and (c) of Proposition 9.3.4 given that none of 
the other cases applies. 

Proposition 9.4.7. Let 05 be injective, and suppose it has an polymorphism of 
type minority (majority). Then Boole(95) has a minority (majority) polymorphism, 
and CSP(Boole(Q5)) can be solved in polynomial time. 

Proof. It is straightforward to show that Boole(OS) has a minority (majority) 
polymorphism. We have seen in Theorem 5.4. 3| that CSP(Boole(Q5)) can then be 



solved in polynomial time. □ 

9.4.2. Tractability for type minority. We show tractability of CSP(Q5) when 
55 has a polymorphism of type minority as well as a binary canonical injection of type 
pi which is balanced. We start by proving that in this case the relations of 55 can 
be defined in (V; E) by first-order formulas of a certain restricted syntactic form; this 
normal form will later be essential for our algorithm. 

Recall that a Boolean relation R is affine if it can be defined by a conjunction 
of linear equations modulo 2, which is the case if and only if R is preserved by the 



Boolean minority operation (see Theorem 5.4.31. In the following, we denote the 
Boolean exclusive-or connective (xor) by ®. 

Definition 9.4.8. A graph formula is called edge affine if it is a conjunction of 
formulas of the form 

xi ^ yi V ... V x k ^y k 

V (tii 7^ i>\ A • • • A ui 7^ vi 

AE(ui,v 1 )®---®E(u h v l ) =p) 

V (ui = vi A • • • A ui = Vi) , 

where p € {0, 1}, variables need not be distinct, and each of k and I can be 0. 

Definition 9.4.9. A ternary operation f: V 3 — > V is called straight if for every 
c G V, the binary operations (x,y) 1— >• f(x,y,c), (x,z) 1— > f(x,c,z), and (y,z) t— > 
f{c,y,z) are balanced injections of type pi. 

Observe that the existence of straight operations and even straight minority op- 
erations follows from the fact that (V; E) contains all countable graphs as induced 
subgraphs. 
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Proposition 9.4.10. Let R be a relation with a first-order definition over (Y;E). 
Then the following are equivalent: 

(1) R can be defined by an edge affine formula; 

(2) R is preserved by every injection of type minority which is straight; 

(3) R is preserved by an injection of type minority, and a balanced binary injec- 



Proof. We first show the implication from 1 to 2, that every n-ary relation R 
defined by an edge affine formula ip(xi, . . . , x n ) is preserved by straight functions / of 
type minority. We verify that each clause </> from if) is preserved by /. By injectivity 
of /, it is easy to see that we only have to show this for the case that </> does not 
contain disequality disjuncts (i.e., for the case k = 0). In this case <f> is of the following 
form, for p £ {0, 1} and m, . . . ,u h vi, . . . ,Vi £ {x\, . . . ,£„}. 



In the following, it will sometimes be notationally convenient to consider tuples in 
(V; E) satisfying a formula as mappings from the variable set of the formula to V. Let 
ti>*2>*3 : {xi, ■ ■ ■ , x n } -> V be three mappings that satisfy cj>. We have to show that 
the mapping to : {xi, . . . , x n } — > V defined by to(x) — f (t\(x) ,t2{x) ^^(x)) satisfies <j>. 

Suppose first that each of ti,t 2 ,t 3 satisfies u\ ^ V\ A • • • A ui ^ In this case, 
h(ui) ^ t (vi)A- ■■ At (ui) ^ t (vi), since / preserves ^. Note that E(t {ui), io(«i)), 
for 1 < i < I, if and only if £(ti«), h(vi)) 8 E(t 2 (ui), t 2 K)) 6 J5(i 3 («i), t 3 (vi)) = 1. 
Therefore, since each t\, t%, t% satisfies E{u\, v{) © • • • © E(ui,vi) — p, we find that to 
also satisfies E(m,vi) © • • • © E(m,vi) = p®p®p = p. 

Next, suppose that one of ^1,^2,^3 satisfies Ui = Vi for some (and therefore for all) 
1 < i < I. By permuting arguments of /, we can assume that t\{ui) = ti(vi) for all 
i £ {1, . . . ,1}. Since the function / is straight, the operation g: (y, z) i-> f(t\{ui), y, z) 
is a balanced injection of type p\. Suppose that ti(ui) ~t 2 (vi). Then E(to{ui), to{vi)) 
if and only if E(t 3 (ui), ts(vi)), since g is balanced. Hence, to satisfies <j). Now suppose 
that tziui) ^ t2(vf). Then E{to(ui),to(vi)) if and only if E(t2(ui),t2(vi)), since g is 
of type p\. Again, to satisfies <f>. This shows that / preserves <f>. 

The implication from 2 to 3 is trivial, since every straight function of type mi- 
nority generates a balanced binary injection of type p\ by identification of two of its 
variables. It is also here that we have to check the existence of straight injections of 
type minority; as mentioned above, this follows easily from the universality of (V; E). 

We show the implication from 3 to 1 by induction on the arity n of the relation R. 
Let g be the balanced binary injection of type pi, and let h be the operation of type 
minority. For n = 2 the statement of the theorem holds, because all binary relations 
with a first-order definition in (V; E) can be defined over (V; E) by expressions as in 
Definition EES 

• For x 7^ y we set k = 1 and / = 0. 

• For ~^E(x,y) we can set k = 0, / = l,p = 0. 

• For -*N(x, y) we can set k = 0, 1 = l,p = 1. 

• Then, E(x,y) can be expressed as (x ^ y) A ~^N{x 1 y). 

• N(x, y) can be expressed as {x 7^ y) A ~^E{x, y). 

• x = y can be expressed as ^E(x, y) A ^N(x, y). 

• The empty relation can be expressed as E(x, y) A N(x, y). 

• Finally, V 2 can be defined by the empty conjunction. 



tion of type p\ . 




A (E{ Ul , vi) © • ■ ■ © E(u h vi) = p)) 
V (ui = Vi A • • • A ui = v{) 
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For n > 2, we construct a formula 4> that defines the relation R{x\, . . . , x„) as 
follows. If there are distinct i,j £ {l,...,7i} such that for all tuples t in R we 
have ti = tj, consider the relation defined by 3xi.R(xi, . . . ,x n ). This relation is also 
preserved by g and h, and by inductive assumption has a definition ip as required. 
Then the formula <f) := (xi — x,j) A tp proves the claim. So let us assume that for all 
distinct i, j there is a tuple t £ R where ti ^ tj. Note that since R is preserved by 
the binary injective operation g, this implies that R also contains an injective tuple. 

Since R is preserved by an operation of type minority, the relation Boole(inj(i?)) 
is preserved by the Boolean minority operation, and hence has a definition by a 



conjunction of linear equations modulo 2 (Theorem 5.4.3). From this definition it is 
straightforward to obtain a definition ip(xi, . . . , x n ) of inj(i?) which is the conjunction 
°f Ai<j<n x i 7^ x j an d °f formulas of the form 

E{u ll v l )®---®E(ui,v l ) = p, 

for Ut,...,ui,vi,...,vi € {x\ , . . . , x n }. It is clear that we can assume that none of the 
formulas of the form E{u\, v{) © • • • © E(ui, vi) = p in -0 can be equivalently replaced 
by a conjunction of shorter formulas of this form. 

For all i, j £ {1, . . . , n} with i < j, let Rij be the relation that holds for the tuple 
(x%, . . . , Xi+t, . . . ,x n ) iff R(xi, . . . , Xj_i, Xj, Xi + i, ... ,x n ) holds. Because Rij is 
preserved by g and h, but has arity n — 1, it has a definition i/;^.,- as in the statement 
by inductive assumption. We call the conjuncts of ijiij also the clauses of tpij. 

Let <f) be the conjunction composed of conjuncts from the following two groups: 

(1) 7 V (xi Xj) for all i < j < n and each clause 7 of V'iji 

(2) when r\ = (E(u%, v\) © • • • ®E{ui, v{) — p) is a conjunct of ip, then <f> contains 
the formula 

(til ^ Ui A ■ • • A ui ^ vi A 77) 
V (m = vi A • • • Aui = Vi) . 

Obviously, <p is a formula of the required form. We have to verify that cj) defines R. 

Let t be an n-tuple such that t (£ R. If t is injective, then t violates a formula of 
the form 

E{u 1 ,v 1 )®---®E(u i ,Vi) =p 

from the formula "0 defining inj(i?), and hence it violates a conjunct of <fi of the second 
group. If there are i, j such that ti = tj then the tuple t l := [t\, . . . , ti_i,tf+i, . . . , t n ) £ 
Rij- Therefore some conjunct 7 of ^ j is not satisfied by t % , and 7 V (a;, 7^ Xj) is not 
satisfied by t. Thus, in this case t does not satisfy 4> either. 

It remains to verify that all t £ R satisfy 4>. Let 7 V [X4 7^ Xj) be a conjunct of <f> 
created from some clause in ipi,j- If U 7^ tj, then t satisfies x, 7^ Xj. If U = tj, then 
(ti, . . . , U-i, fj+i, . . . ,t n ) € -Ri.j and thus this tuple satisfies tpij. This also implies 
that t satisfies 4>. Now, let rj be a conjunct of from the second group. We distinguish 
three cases. 

(1) For all 1 < i < I we have that t satisfies Uj = Uj. In this case we are clearly 
done since i satisfies the second disjunct of rj. 

(2) For all 1 < i < Z we have that t satisfies u, 7^ Uj. Suppose for contradiction 
that t does not satisfy E(ui, Vi) © • • -®E{ui,vi) = p. Let r £ Rbe injective, 
and consider the tuple s := g(t, r). Then s £ R, and s is injective since the 
tuple r and the function g are injective. However, since g is of type pi, we 
have E(s(ui),s(vi)) if and only if E(t(ui),t(vi j), for all 1 < i < I. Hence, s 
violates the conjunct E(u\,vi) © • • • © E{ui,v{) — p from ip, a contradiction 
since s £ inj(i?). 
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(3) The remaining case is that there is a proper non-empty subset S of {1, . . . , 1} 
such that t satisfies Ui — Vi for all i € S and t satisfies Ui ^ Vi for all 
i € {1, . . . ,n} \ S. We claim that this case cannot occur. Suppose that all 
tuples t' from inj(i?) satisfy that ®j £S E{m,Vi) = d for some d € {0, 1}. In 
this case we could have replaced E(u\, v{) © • • • © E{ui,v{) = p by the two 
shorter formulas iGS v{) = d and ie {i i ...,„}\5 E(m, Vi) = pffi d, in 
contradiction to our assumption on ip. So, for each d € {0, 1} there is a tuple 
s rf e inj(i?) where (§} ieS E(ui,Vi) = d (and thus (Bie{i,..., n }\S E ( u i' v i) 
j> © <f). Now, for the tuple g(t, si_ p ) we have 

E(ui,Vi) = ©^(tii.Wi)© E(u i: Vi) 

ie[n] ieS i£[n]\S 

= P®(p®(l-p)) 

= i-p¥ z p 

which is a contradiction since <7(t, si_ p ) € inj(i?). 
Hence, all t € i? satisfy all conjuncts of We conclude that </> defines i?. □ 

We now present a polynomial-time algorithm for CSP(Q3) for the case that 05 has 
finitely relations that are all edge affinc. 

Definition 9.4.11. Suppose all relations of 25 are edge affine, and let <j) be an 
instance o/CSP(Q5). Then the graph of <p is the (undirected) graph whose vertices are 
unordered pairs of distinct variables of <fi, and which has an edge between distinct sets 
{a, b} and{c,d} if <f> contains a constraint whose definition as in Definition 9.4-8 has 
a conjunct of the form 

(it! ^ vi A • • ■ A ui ^ vi A (E^uvt) © • • • © E(u u v{) = p)) 

V (ui = Vi A • • • A ui = v{) 

such that {a, b} — Vi} and {c, d} — {uj, Vj} for some i,j G {1, . . . , I}. 

It is clear that for 55 with finite signature, the graph of an instance 4> of CSP(Q5) 
can be computed in linear time from (j). 

Definition 9.4.12. Let 05 only have edge affine relations, and let cj) be an instance 
o/CSP(Q5). For a set C of 2- element subsets of variables of (j), we define inj($,C) to 
be the following affine Boolean formula. The set of variables o/inj(0, C) is C. The 
constraints o/inj(</>, C) are obtained from the constraints 7 of (j) as follows. If 7 has 
a definition as in Definition \9.4-8 with a clause of the form 

(ui ^ vi A • • • A ui ^ vi A (E(ui,Vi) © • • • © E{u u vi) = p)) 

V (ui = vi A • • • A ui = vi) 

where all pairs v{\ are in C , then inj(</>, C) contains the conjunct {u\, v%} © • • • © 
{u u vi} = p. 



Tractability of case (d) of Proposition 9.3.4 now follows from the following propo- 
sition and Proposition |9.4.10| 

Proposition 9.4.13. Let 05 be a structure with a first-order definition in (V; E) 
and a finite signature, and suppose that 25 is preserved by a straight injection of type 
minority. Then the algorithm shown in Figure 9.3 solves CSP(05) in polynomial time. 

Proof. We first show that when the algorithm detects a constraint that is false 
and therefore rejects in the innermost loop, then <f> must be unsatisfiable. Since 
variable contractions are the only modifications performed on the input formula </>, it 
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// Input: An instance 4> of CSP(5B) with variables V 
Repeat 

For each connected component C of the graph of <f> do 
Let ip be the affine Boolean formula inj (</!>, C). 
If ijj is unsatisfiable then 
For each {x, y} £ C do 

Replace each occurrence of x by y in (f>. 
If <j> contains a false constraint then reject 
Loop 

Until inj ((/>, C) is satisfiable for all components C 
Accept 



FIGURE 9.3. A polynomial-time algorithm for CSP(03) when 03 is 
preserved by a straight operation of type minority. 



suffices to show that the algorithm only equates variables x and y when x — y in all 
solutions to (f>. To see that this is true, assume that ip := inj(0, C) is an unsatisfiable 
Boolean formula for some connected component C . Hence, in any solution s to 4> 
there must be a pair {x,y\ in C such that s[x) — s(y). It follows immediately from 
the definition of the graph of <fi that then s(u) = s(v) for all {u, v} adjacent to {x, y} 
in the graph of <f>. By connectivity of C, we have that s(u) = s(v) for all {u,v} G C. 
Since this holds for any solution to (j), the contractions in the innermost loop of the 
algorithm preserve satisfiability. 

So we only have to show that when the algorithm accepts, there is indeed a 
solution to 4>. When the algorithm accepts, we must have that inj(^, C) has a solution 
sc for all components C of the graph of <p. Let s be a mapping from the variables 
of (p to the V such that E(xi, Xj) if {xi, Xj} is in component C of the graph of cj> and 
Sc{{xi,Xj}) = I, and N{xi,Xj) otherwise. It is straightforward to verify that this 
assignment satisfies all of the constraints. □ 



9.4.3. Tractability for type majority. We turn to case (e) of Proposition 9.3.4 
i.e., the case where 03 has ternary injection of type majority and a binary canonical 
injection of type p\ which is balanced. 

Recall that a Boolean relation is called bijunctive if it can be defined by a con- 
junction of clauses of size at most two. It is well-known that a Boolean relation 
is bijunctive if and only if it is preserved by the Boolean majority operation (see 



Section 5.4). 



Definition 9.4.14. A relation R on (Y:E) is called graph bijunctive if it can be 
defined in (Y;E) by a conjunction of disjunctions of dis equalities, and of formulas of 
the form 

Xi^yiW ... V x k ^y k 

V (ui ^v 1 Au 2 ^v 2 A(X(u 1 ,v 1 )\/Y(u2,v 2 ))) 

V (ui = v\ A u 2 = v 2 ) , 

where X,Y G {E,N}, variables need not be distinct, and k can be 0. 

Proposition 9.4.15. Let R be a relation with a first-order definition in (Y;E). 
Then the following are equivalent. 

(1) R is graph bijunctive; 

(2) R is preserved by every straight function of type majority; 
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(3) R is preserved by a function of type majority and a balanced injection of type 
Pi- 



PROOF. The proof is very similar to the proof of Proposition |9.4.10| We first show 
the implication from 1 to 2, that relations that are graph bijunctive are preserved by 
straight functions / of type majority. By injectivity of /, it suffices to show this for 
the case that the formulas do not contain disequality disjuncts (i.e., k = 0). Since 
the clauses 4> of such a formula are such that Boole(0) is bijunctive, the claim follows 
from the fact that bijunctive Boolean relations are preserved by the Boolean majority 
operation in very much the same way as in Proposition |9.4.10| 

For the implication from 2 to 3, observe that straight functions of type majority 
exist since (V; E) is universal, and that identifying two variables of such an operation 
yields a balanced injection of type p\. 

We show the implication from 3 to 1 by induction on the arity n of the relation 
R. Let (V; E) be the balanced binary injection of type pi, and let h be the operation 
of type majority. For n = 2 the statement of the proposition holds because all binary 



relations with a first-order definition over (V; E) can be defined as in Definition 9.4.14 

• for ^E(x,y) we can set k = 0, X = Y := N, u\ = v\ := x, u-i — t>2 := y; 
dually, —<N(x, y) can be defined; 

• For x ^ y, this is trivial; 

• E(x,y) can be defined as the conjunct of x ^ y and ^N(x,y); dually, we 
can define N(x,y); 

• The relation x — y can be obtained as the conjunction of ->E(x,y) and 
~-N(x,y); 

• The empty relation is obtained as the conjunction of E(x,y) and N(x,y)\ 

• Finally, V 2 can be defined by the empty conjunction. 

For n > 2, we construct the formula c/> that defines the relation R(x±, . . . ,x n ) 
as follows. If there are distinct i,j E {1, . . . ,n) such that for all tuples t in R we 
have ti = tj, consider the relation defined by 3xi.R(xi, . . . ,x n ). This relation is also 
preserved by (V; E) and h, and by inductive assumption has a definition ^ as required. 
Then the formula <f) := (xi = Xj) A tp proves the claim. So let us assume that for all 
distinct i,j there is a tuple t E R where ti ^ tj. Note that since R is preserved by 
the binary injective operation g, this implies that R also contains an injective tuple. 

Since R is preserved by a function of type majority, the relation Boole(inj(i?.)) 
is preserved by the Boolean majority operation, and hence has a definition by a 
2SAT formula (Theorem 5.4. 3| ) . From this definition it is straightforward to obtain a 
definition ip(xi, . . . , x n ) of inj(i?) which is the conjunction of Ai<i<j<« x i ^ x j an< ^ 
of formulas of the form E(u, v), N(u, v), or 

X(ui,vi) V Y(ui,Vi) , 

for ui, U2, vi,V2 E {x\, . . . , x n }, and X, Y E {E, N}. We can assume (by removing 
successively literals from clauses) that this formula is reduced, i.e., that each of the 
conjuncts is such that removing any of its literals results in an inequivalent formula. 

For all i,j E {1, . . . ,n} with i < j, let Ri.j be the relation that holds for the 
tuple (xi, . . . , Xi—i, afi+i, . . . , a; n ) iff R(xi,...,Xi-i,Xj,Xi+i,...,x n ) holds. Because 
also Ri j is preserved by g and h, but has arity n — 1, it has a definition ip^j as in the 
statement by inductive assumption. We call the conjuncts of ip+ j also the clauses of 

Let <f> be the conjunction composed of conjuncts from the following two groups: 
(1) 7 V (xi 7^ Xj) for all clauses 7 of ipij and for all i < j < n; 
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(2) when n = (X(u±,Vi) V Y(u2,V2)) is a conjunct of ip, then <fi contains the 
formula 

(rj A ui ^ vi A u 2 7^ v 2 ) V (ui = i>i A ti 2 = v 2 ) ■ 

Obviously, (/> is a formula in the required form. We have to verify that (j) defines R. 

Let t be an n-tuple such that t ^ R. If t is injective, then since t ^ inj(i?), 
it violates a clause of the form X(u\,V\) V Y(ui,vi) of ip, and hence it violates 
the corresponding clause in <f>. If there are i,j such that tj — tj then the tuple 
t % := (ti, . . . , ti-i,U+ii ■ ■ ■ )*n) ^ -Rij - Therefore some clause 7 of is not satisfied 
by t l , and 7 V ^ is not satisfied by i. Thus, in this case t does not satisfy (j) 
either. 

It remains to be verified that all t € R satisfy (/). Let 7 V ^ Xj be a conjunct of 
4> created from some clause 7 of ipij. If U ^ tj, then £ satisfies Xi ^ Xj. If ti — tj, then 
(ti, . . . , U-i, ti+i, . . . , t n ) e Rij and thus this tuple satisfies ipij. Hence, t satisfies 7. 
Now, let 77 be a conjunct of cj) from the second group, so it is of the form 

V = ("1 ^ vi A u 2 ^ v 2 A (X(ut,V!) V Y(u 2 ,v 2 ))) 
V (ui — V\ A u 2 — v 2 ) . 

We distinguish three cases. 

(1) The tuple t satisfies both u\ = V\ and u 2 — v 2 . In this case we are clearly 
done since t satisfies the second disjunct of 77. 

(2) The tuple t satisfies u\ ^ v 2 and u 2 ^ v 2 . Then the argument is exactly the 
same as the argument in the proof of Proposition 9.4. 10| 



(3) The remaining case is that t satisfies u\ = v\ and u 2 7^ v 2 (or u\ 7^ v% 
and u 2 = v 2 , but the proof there is symmetric). We claim that this case 
cannot occur. If t satisfies Y(u 2 ,v 2 ), we are done; so let us assume that t 
satisfies ~^Y(u 2 , v 2 ). Since we assumed that ip is reduced, there exists a tuple 
s e inj(i?) (and hence in R) such that -^X(u\,u\) and Y(ui,v{); otherwise, 
we could have replaced the clause X(u\,v\) V Y(u 2 ,v 2 ) by X(u\,vi), and 
obtain a contradiction with the assumption that ip is reduced. The tuple 
r := g(t,s) is injective, and satisfies —Y (u 2 ,u 2 ) (since g is of type pi) and 
it also satisfies ^X(u\,vi) (since g is balanced). Since g is injective, we 
have found a tuple r € inj(i?) that does not satisfy X(ui,v\) V Y(ui,v\), a 
contradiction. 

□ 



Combining the following proposition with Proposition 9.4.15|finishes the proof of 



polynomial-time tractability of case (e) of Proposition 9.3.4 



Proposition 9.4.16. Let 58 be a structure with a first-order definition in (V; E) 
and a finite relational signature, and suppose that 03 is preserved by a straight junction 
of type majority. Then CSP(Q3) can be solved in polynomial time. 

Proof. The algorithm for CSP(03) is a straightforward adaptation of the proce- 



dure given in Figure 9.3 with the difference that instead of affinc Boolean equation 



systems we have to solve 2SAT instances in the inner loop. □ 

9.4.4. Tractability of types max and min. We are left with the case where 03 
has a canonical binary injective polymorphism of type max or min, which corresponds 



to case (f) of Proposition 9.3.4 



We claim that we can assume that this polymorphism is either balanced, or of 
type max and i?-dominated, or of type min and iV-dominated. 
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Proposition 9.4.17. If 03 = (V; E, N, ^, . . .) is first- order definable in (V; E) 
and has a canonical binary infective polymorphism of type max (mm), then it also 
has a canonical binary injective polymorphism of type max which is balanced or E- 
dominated (N -dominated) . 

Proof. We prove the statement for type max (the situation for min is dual). Let 
p be the polymorphism of type max. Then h(x, y) := p(x, p(x, y)) is not iV-dominated 
in the first argument; this is easy to see. But then p(h(x, y), h(y, x)) is either balanced 
or iJ-dominated, and still of type max. □ 

We apply Theorem |6.2.4| to our setting as follows. 

Proposition 9.4.18. Let 03 be a structure with a first-order definition in (V; E) 
and a finite relational signature, and suppose B is preserved by a binary canonical 
injection which is of type max and balanced or E-dominated, or of type min and 
balanced or N -dominated. Then CSP(03) can be solved in polynomial time. 

Proof. First note that 

GSP(Y; E,^E,N,^N,=,^) 

can be solved in polynomial time. One way to see this is to verify that all relations 
are preserved by a straight polymorphism of type majority, and to use the algorithm 
presented in Section [9.4. 3| We observe the following. 

• A canonical binary injection which is of type min and ./V-dominated is an 
embedding of (V; E, =) 2 into (V; E, =). 

• A canonical binary injection which is of type max and -E-dominated is a an 
embedding of (V; N, =) 2 into (V; N, =). 

• A canonical binary injection which is of type max and balanced is an em- 
bedding of (V; ->E, =) 2 into (V; ->E, =). 

• A canonical binary injection which is of type min and balanced is an em- 
bedding of (V; ->N, =) 2 into (V; ->N, =). 



In each case, polynomial-time tractability of CSP(03) follows from Theorem 6.2.4 □ 
This completes the proof of Proposition |9.3.4| 

9.5. Classification 

Thcorem |9.1.3| shows that the CSP for every template with a first-order definition 
in (V; E) is NP-hard, or polynomial-time tractable. In this section we present a refined 
description of the polymorphisms that imply tractability. This leads to a dichotomy 
result for structures 03 with a first-order definition in (V; E) that holds without any 
complexity-theoretic assumptions, which has already been stated in Theorem |9.1.4| 
either 

(1) there is a primitive positive interpretation of ({0, 1}; 1IN3) in 03 (in this case 



03 has a finite-signature reduct with an NP-hard CSP, by Corollary 5.5.8), 
or 

(2) 03 has a 4-ary polymorphism and automorphisms ai, ct2 such that 

Vx, y e V. f(y, y, x, x) = aif(x, x, x, y) = a 2 f(y, x, y, x) (11) 

(in this case every finite signature reduct of 03 has a polynomial-time tractable 
CSP). 



It follows from Proposition 5.6.10 that (1) and (2) are indeed disjoint cases. 

In order to prove that every 03 satisfies (1) or (2) above, we first determine a list 
of 17 operations with the following properties: 
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(a) every structure 03 with a first-order definition in (V; E) either interprets 
({0, 1}; 1IN3) or is preserved by one of those 17 operations; and 

(b) the list is minimal, that is, if any operation is removed from the list, then 
the list looses property (a). 

For the purpose of this chapter, let us call a clone ^ tame if it is the polymorphism 
clone of a structure that does not primitively positively interpret ({0, 1}; 1IN3) (for 
equivalent characterizations of tame clones, see Theorem 5.5.18[ ). Then using this 



terminology, the list above gives us 17 tame clones that contain the automorphisms 
of (V; E) and that are inclusion minimal with those properties. Our next step will be 



the verification that each of the 17 operations generates an operation satisfying ( 11 ). 
Such operations have been called Siggers terms modulo automorphisms of 03 in The- 
orem 5.6.8 It will turn out that CSP(03) can be solved in polynomial time if 03 has 



one of those operations as a polymorphism. 

Definition 9.5.1. Let B be a behavior for binary functions on (W;E). A ternary 
injection / : V 3 — > V is hyperplanely of type B if the binary functions (x, y) i— > 
f(x, y, c), (x, z) i — y f(x, c, z), and (y, z) t— > /(c, y, z) have behavior B for all c G V. 

We have already met a special case of this concept in Definition |9.4.9| of Sec- 
tion 9.4.2 a ternary function is straight balanced p\ if and only if it is hyperplanely 
balanced and of type p\ . The following behaviors of binary functions appear hyper- 
planely in ternary functions of our classification result. 

Definition 9.5.2. A binary injection f: V 2 — > V is of type 

• i?-dominatcd if the image of f is a clique; 

• iV-dominated if the image of f is an independent set; 

• xnor if for all u,v G V 2 with ^=^(u,v) the relation E(f(u),f(v)) holds if 
and only if EE{u,v) or NN(u,v) holds; 

• xor if for all u,v G V 2 with j^j^(u,v) the relation E(f(u),f(v)) holds if and 
only if neither EE{u, v) nor NN(u, v) hold. 

The following theorem lists the tame minimal clones, and immediately implies 



Theorem 9.5.3. Every minimal tame clone is generated by the automorphisms 
of (V; E) and one of the following operations. 

(1) A constant operation, 

(2) a function of type majority which is hyperplanely balanced and of type pro- 
jection, 

(3) a function of type minority which is hyperplanely balanced and of type pro- 
jection, 

(4) a balanced binary injection of type max, 

(5) an E-dominated binary injection of type max, 

(6) a function of type majority which is hyperplanely E-constant, 

(7) a function of type majority which is hyperplanely of type max and E-dominated, 

(8) a function of type minority which is hyperplanely of type projection and E- 
dominated, 

(9) a function of type minority which is hyperplanely of type xnor and E-dominated, 
(10) a binary injection which is E-constant, 

or the dual of one of the last seven operations. Those 17 clones are pairwise incom- 
parable with respect to set inclusion, and each one contains a four-ary operation f 
satisfying Siggers term identities modulo its invertible operations. 
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PROOF. We first discuss why each of the 17 clones contains a 4-ary operation / 
satisfying Siggers term equations modulo its invertibles. By Proposition |5. 6.101 this 
implies in particular that each of the 17 clones is tame. 

Observe that all of the 17 operations listed above are canonical as functions over 



(Y;E). Let g be one of them. By Lemma 5.6.7 there is a homomorphism /i from 
(V; g) 2 to an algebra A = ({=, E, N}, /) (where =, E, and N are the image of [i for 
the pairs (x,y) G V 2 such that x = y, E(x,y), or N(x,y), respectively). 

In three of the 17 cases the algebra A is not idempotent. This is the case 
when g is constant; but then /: (x,y,u,v) i— > g(g{g(x,y),u),v) satisfies in partic- 



ular Vx, y.f(y, y, x, x) — f(x, x, x, y) = f(y, x, y, x). The other two cases are case ( 10 1 
and its dual. In the first case, the image of g induces an infinite clique in (V; E); the 
following argument applies in dual form to the second case as well. As in the proof of 
Corollary |3. 6. 2| we see that 23 is preserved by all permutations of its domain. Also note 
that A has a congruence with the congruence classes {—} and {E, N} (see Proposi- 
tion 6.1.4), and in the corresponding quotient algebra g denotes max with respect to 
the order {=} < {E, N}. It is then easy to see that /: (x, y, u, v) i— > g(g(g(x, y),u), v) 
satisfies Siggers term identities modulo a permutation. 

By Theorem |5.6.4| in combination with Theorem |5.6.2| and Theorem |5.6.3[ every 
finite idempotent algebra either 

• has a 2-element factor all of whose operations are projections, or 

• the operations of the algebra generate an operation / satisfying 

f(y, V, x, x) = f(x, x, x, y) = f(y, x, y, x) . 

Hence, by Lemma |5.6.7[ it suffices to show in the the remaining 14 cases that all 
factors of A contain operations that are not projections. Since in those 14 cases both 
E and N are preserved by g, the relation ^ is also preserved, and {E, N} induce a 
subalgebra of A. In this subalgebra and if the operation g is ternary, it either acts 
as a majority (that is, g{x,x,y) — g(x,y,x) — g{y,x 1 x) = x), or as a minority (that 
is, g(x,x,y) = g(x,y,x) = g(y,x,x) = y). If / is binary, it satisfies g(x,y) = g(y,x) 
in this subalgebra. In all cases, / does not act as a projection. Four out of the 14 
remaining operations are balanced, which is equivalent to saying that both {E, =} 
and {N, =} induce a subalgebra B in A. In this case it is easy to check from the 
description of the balanced operations in Theorem |9.5.3| that 

g(x,y) satisfies g{x,y) = g{y,x) if / is binary, and (12) 
h{x,y) := g(x,x,y) satisfies h(x,y) = h(y,x) if g is ternary. (13) 

So g is not a projection in those factors as well. For five of the remaining non-balanced 
operations we have that {E, —} induces a subalgebra of A. Again, g satisfies the 



condition in (12). For the other five remaining operations, the set {N, =} induces 
a subalgebra, and the argument that the operation / is not a projection in those 
algebras is analogous. 

Finally, we have to argue that the operation g is in none of the 2-element ho- 
momorphic images of A a projection. Since all of the 14 remaining operations are 



injective, they have a congruence with the classes {E, N} and {=} (Proposition 6.1.4) 



Then the operation g satisfies ( 12 ) in the corresponding factor. It can be verified that 
from all 14 operations, only 

• the balanced operation of type max, 

• the N-dominated operation of type min, 

• and the edge majority that is hyperplanely of type min and N-dominated 
preserve the relation E(x,y) <-> E(u,v). In those cases, the algebra A has a congru- 
ence with the classes {E} and {N, =}. For the balanced operation of type max, and 
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the N-dominated operation of type min, the corresponding quotient the operation g 



satisfies the condition in ( 12 ). For the edge majority that is hyperplanely of type min 



and N-dominated, the condition in (13) applies. Congruences of A with the classes 
{N} and {E, =} can be checked analogously. We conclude that all operations in the 
list generate tame clones. 

It is not difficult to see and even automatically verifiable that all of the clones 
are incomparable, that is, none of the clones in the list contains another clone of the 
list. We next prove that any tame clone which contains Aut((V;i?)) contains one of 
the clones of the theorem; together with incomparability, this implies that our list is 
exhaustive, and that the clones in our list are indeed minimal tame clones. 

So suppose that ^ is any tame clone, and let 05 be a relational structure whose 
polymorphism clone is 'rf. If c (§ contains a constant operation, then it contains the 
operation from case 0, so we may assume that this is not the case. Assume next that 
^ contains e#. Then consider the structure 23 induced in 05 on the image D :— e_g[V]. 
Since 05 and 23 are homomorphically equivalent, 23 is also tame. Since 23 is definable 



in the structure (D; =) it follows from Theorem 6.3.3 that it has a polymorphism 
which is either a constant or a binary injection. The former case is impossible as 
otherwise also 05 has a constant polymorphism by composing the constant of 23 with 
&e- Let thus g(x,y) be the binary injection on D which is a polymorphism of 23. 
Then h(x, y) := g(eE(x), es(y)) is a polymorphism of 05. But h is a binary canonical 



injection which is £ , -constant, and hence c i contains the operation from case (10) of 
our list. The argument when ^ contains e^r is identical. 

Hence, by Theorem |9 . 2 . 4| it remains to consider the case where the endomorphisms 
of 05 are generated by the automorphisms of 05, that is, 05 is a model-complete core 



(Theorem 3.6.11 ). Let :— Aut(05). By Theorem 9.2.6 there are five possibilities for 



Suppose first that Sf is the group of all permutations on V. Since "if is tame and 



contains no constant operation, it contains all binary injections, by the results of 37 



In particular, contains, say, a balanced binary canonical injection operation of type 
max. 

We can therefore assume that Sf does not contain all permutations. Next con- 
sider the case where <$ is the group generated by the function — : V — > V together 
with Aut((V; E)). First note that the relation P 1 - 3 ' does not have a primitive positive 
definition in 05, since in this case ({0, 1}; NAE) would have a primitive positive in- 
terpretation in 05 by Proposition |9.2.9| and by Theorem |5.5.18| €. would not be tame 
contrary to our assumptions. Since — is in Sf, the relation P^ consists of only one 
orbit of 05, and hence is violated by an endomorphism of 05. This is a contradiction 
since the function — generates all endomorphisms, and preserves P^ 3 \ Now suppose 
that ^ contains sw and is a proper subgroup of the full symmetric group. Then 
the relation T has just one orbit in 05, and since T must be violated, it is also vio- 
lated by an endomorphism, and hence also an automorphism of 05. This is again a 
contradiction since the functions in Sf preserve T. 

It remains to consider the case = Aut((V;£')); this implies that all poly- 
morphisms of 05 preserve E and N, so 05 contains the relations E and N. Thus 
Proposition |9.3.4| applies, and contains a binary canonical injection of type max 
or min, or a function of type minority or majority. If it contains a canonical in- 
jection of type max or min, then it contains one of the clones from case Q to ^ 



by Proposition 9.4.17 Otherwise, it contains a ternary injection t of type minority 



or majority, and one of the binary canonical injections of type projection listed in 



Theorem 9.3.8 which we denote by p. Set s(x,y,z) := t(p(x,y),p(y, z),p(z,x)) and 
w(x,y,z) := s(p(x, y), p(y , z),p(z, x)). Then w is one of the clones from case (§ to 
case ([9]) — see Figure [9T4| we leave the verification to the reader. □ 
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Binary injection type pi 


Type majority 


Type minority 


Balanced 

_E-dominated 

iV-dominated 

Balanced in 1st, _E-dom. in 2nd arg. 
Balanced in 1st, JV-dom. in 2nd arg. 


Hp. balanced, type pi 
Hp. inconstant 
Hp. iV-constant 
Hp. type max, _E-dom. 
Hp. type min, iV-dom. 


Hp. balanced, type p\ 
Hp. type pi, i5-dominated 
Hp. type pi, iV-dominated 
Hp. type xnor, iJ-dom. 
Hp. type xor, iV-dom. 



FIGURE 9.4. Minimal tractable canonical functions of type majority 
and minority, and their corresponding canonical binary injections of 
type projection. 




FIGURE 9.5. The border: Minimal tractable and maximal hard 
clones containing Aut((V; E)). 



Figure 9.5 shows the border between the hard and the tractable clones. The 
picture contains all minimal tractable clones as well as all maximal hard clones, plus 
some other clones that are of interest in this context. Lines represent containment of 
clones, but edges that are implied by transitivity of containment are not drawn. Note 
that lines do not mean to imply that there are no other clones between them which 
are not shown in the picture. Clones are symbolized with a double border when they 



have a dual clone (generated by the dual function in the sense of Definition 9.3.7 



whose behavior is obtained by exchanging E with N, max with min, and xnor with 
xor). Of two dual clones, only one representative (the one which has E and max in 
its definition) is included in the picture. The numbers of the minimal tractable clones 
refer to the numbers in Theorem 19.5.31 "£ , -semidom" refers to "balanced in the first 
and _B-dominated in the second argument" . 
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PROOF of Theorem 19.1.41 By Theorem |9.5.3[ either there is a primitive pos- 
itive interpretation of ({0,1}; 1 INS) in 03, and the statement follows from Corol- 
lary [5375J or 03 is preserved by one out of the 17 canonical minimal tame operations 
listed in Theorem 19.5.31 

So we are left with the verification that every finite signature reduct 03' of 03 that 
is preserved by one of those 17 operations has a polynomial-time tractable CSP. For 



the constant operation this is Proposition 1.1.11 Cases (4) and (5) are tractable by 



case (f) of Proposition 9.3.4 In all cases, the duals can be solved analogously. The 
functions of type majority or minority are tractable by cases (b) to (e) of Proposi- 
tion 9.3.4 in those cases, certain binary canonical injections of type projection are 



required - these are obtained by identifying any two variables of the function of type 



majority / minority; Figure 9.4 shows which function of type majority / minority 
yields which type of binary injection. We leave the verification to the reader. Finally, 
let f(x,y) be an inconstant binary injection (case (10)), and denote the reduct cor- 
responding to this clone by 03. Then g(x) := f(x,x) is a homomorphism from 58 to 
the structure £ induced by the image <?[V] in 03. This structure £ is invariant under 
all permutations of its domain, and hence is definable in (<7[V];=); such structures 
have been treated in Chapter [6] The structure £ has a binary injection among its 
polymorphisms (namely, the restriction of / to £). It then follows from the results 



37 that CSP(£) is tractable. Hence, CSP (03) tractable as well, since 03 and £ are 



homomorphically equivalent. □ 

By inspection of the primitive positive interpretations used in the classification 
proof, we also obtain the following (the relation E§ has been introduced in Defini- 
tion 



5.5.9) 



COROLLARY 9.5.4. 03 primitively positively interprets 1IN3 if and only if one Ej , 
T, H , or P' 3 ) has a primitive positive definition in 03. 

Corollary 9.5.5. There is an algorithm that, given quantifier-free first-order 
formulas 4>i, . . . ,<f> n defining over (W;E) the relations Ri,...,R n , decides whether 
CSP((V; R\, . . . , R n )) is tractable or NP-complete. 

Proof. Follows from Theorem |8.4.4| in combination with Corollary |9.5.4| □ 
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This chapter contains results from 38 39 47 . 



10.1. Introduction 

A temporal relation is a relation R C Q k , for some finite k, with a first-order 
definition in (Q;<), the ordered rational numbers (which can be thought of time 
points). A temporal constraint language is a set of temporal relations, and will be 
treated here as a relational structure with a first-order definition in (Q; <). Constraint 
satisfaction problems for temporal constraint languages will be called temporal CSPs 
in the following. We have already discussed some temporal constraint languages in 



Section 1.5 for instance and/or precedence constraints from scheduling, and Ord- 
Horn constraints on time points. 

There are also several famous NP-complete temporal CSPs. For example the 



Bctwcenncss Problem 95 , which has been introduced in Example 1.1.3 as a CSP 



with domain Z, can also be formulated as CSP((Q; Betw)) where Betw is the ternary 
relation Betw = {(x,y,z) <E Q 3 | (x < y < z)\/(z < y < x)}. We have seen in Proposi- 
tion 5.5.14 that this CSP is NP-hard. Similarly, the Cyclic Ordering Problem [95] can 
be formulated as the CSP for (Q; {(x, y, z) \ (x < y < z)V(y < z < x)V(z < x < y)}), 



and is also NP-complete 93 (a hardness proof using primitive positive interpretations 
can be found in Section 10.2.3[ ) . 
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A subclass of temporal CSPs called ordering CSPs has been introduced in 103 
An ordering CSP is a temporal CSP where the constraint language only contains 
relations where the arguments are pairwise distinct (thus, CSP((Q; <,^)) is not an 
ordering CSP). Satisfiability thresholds for random instances of ordering CSPs have 
been studied in |97| . Approximability of ordering CSPs has been studied in |115 



The class of temporal constraint languages is of fundamental importance for in- 
finite domain constraint satisfaction, since CSPs for such languages appear as im- 
portant special cases in several other classes of CSPs that have been studied, e.g., 
constraint languages about branching time, partially ordered time, spatial reasoning, 



and set constraints 54 81 121 . Moreover, several polynomial-time solvable classes 



of constraint languages on time intervals 80 133 158 can be solved by translation 



into polynomial-time solvable temporal constraint languages. 

In this chapter we prove a complete classification of the computational complexity 
of CSP (05) when 05 is a temporal constraint language. 

Theorem 10.1.1. Let 05 be a structure with a first-order definition in (Q;<). 
Then exactly one of the following two cases is true. 

• 05 has a polymorphism f and automorphisms ao,ai,a 2 satisfying 

Vxi, x 2 G Q. f(xi, x 2 ) = a f(aix 2 , a 2 xi) . 

In this case, CSP(OS') is in P for every finite reduct 05' of 05. 

• 05 is a model- complete core, and the structure ({0, 1}; HNS) has a primitive 
positive interpretation in the expansion of 05 by finitely many constants. In 
this case, CSP (05) is NP-hard by Corollary \5~5^ 



Our classification proof is based on the universal-algebraic approach and Ramsey 
theory as described in Chapter [5] and Chapter [8j 

10.2. Preliminaries 

10.2.1. Cameron's theorem. In this subsection we recall the classical result 
of Cameron [64] that describes temporal constraint languages up to first-order inter- 
definability. For x\, . . . ,x n G Q write X\ . . . x„ when X\ < . . . < x n . 



Theorem 10.2.1 (Relational version of Camerons theorem; see e.g. 122| ). Let 



05 be a temporal constraint language. Then 05 is first-order interdefinable with exactly 
one out of the following five homogeneous structures. 

• The dense linear order (Q; <) itself, 

• The structure (Q; Betw), where Betw is the ternary relation 

{(x,y,z) G Q 3 | xyt V lyi} , 

• The structure (Q; Cycl), where Cycl is the ternary relation 

{(x,y,z) | xyt V yzx V zxt)} , 

• The structure (Q; Sep), where Sep is the 4~ary relation 

{(xi, yi, £2, 2/2) I xix 2 yiyl V xiy 2 yiX2 V yix 2 xiy 2 V y^xixl 

V x 2 x 1 y 2 y{ V x 2 yiy 2 x\ V 2/2^1^22/1 V y 2 yix 2 xl} , 

• The structure (Q;=). 

The relation Sep is the so-called separation relation; note that Sep{x\, y\, x%, y 2 ) 
holds for elements X\,y\,x 2 ,y 2 G Q iff all four points x±,yi,x 2 ,y 2 are distinct and 
the smallest interval over Q containing Xi,yi properly overlaps with the smallest 
interval containing x 2 , y 2 (where properly overlaps means that the two intervals have 
a non-empty intersection, but none of the intervals contains the other) . 
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The next theorem is also due to Cameron 64 , and was his original motivation for 
the investigation of structures with a first-order definition in (Q; <). It is not used for 
our results; however, we would like to state it here because it provides a fundamentally 
different characterization of the class of temporal constraint languages. 

Theorem 10.2.2. A relational structure 03 is highly set-transitive if and only if 
it is a temporal constraint language. 

10.2.2. Polymorphisms of Temporal Constraint Languages. For this chap- 
ter only, we make the following convention. We say that a set of operations T gener- 
ates an operation g if T together with all automorphisms of (Q; <) locally generates 
g. In case that T contains just one operation /, we also say that / generates g. 

A fc-ary operation / on Q defines a weak linear order -< on Q fc , as follows: for x, y £ 
Q k , let x -< y iff f(x) < f(y). The following observation follows straightforwardly 
from Proposition |5.2.T] 

Observation 10.2.3. Let f and g be two k-ary operations that define the same 
weak linear order on Q k . Then f generates g and g generates f. 

We now define fundamental operations on Q. The unary operation -h- is defined 
as 4->(x) := —x in the usual sense. Let c be any irrational number, and let e be 
any order-preserving bijection between (— oo,c) and (c, oo). Then the operation O is 
defined by e(x) for x < c and by e~ 1 (x) for x > c. With these operations and the 
notion of generation, Cameron's theorem can be rephrased as follows. 



Theorem 10.2.4 (Operational version of Camerons theorem; see e.g. 122| ). Let 



5B be a temporal constraint language. Then exactly one of the following holds. 

• Aut(Q3) equals Aut((Q; <)); 

• The automorphisms o/*8 are the permutations generated by 

• The automorphisms o/*B are the permutations generated by O; 

• The automorphisms o/*B are the permutations generated by -H- and O; 

• Aut(Q5) equals Sym(Q). 

If / is a fc-ary operation on Q, then the operation f>/(mi, . . . , ■H-acj,) is called the 
dual of /. Note that if / preserves an m-ary relation R, then the dual of / preserves the 
relation f>i?, which is defined to be the relation {(<s->ai, . . . , Ha ro ) | (oi, . . . , a m ) £ R}. 
Clearly, CSP((Q; Ri, . . . , R k )) and CSP((Q; -B-iJi, . . . ,<-KR*)) are exactly the same 
computational problem. 

10.2.3. Hard temporal CSPs. In this subsection we discuss various important 
NP-complete temporal constraint satisfaction problems. We have already mentioned 
in the introduction that the Betweenness and the Cyclic Ordering Problem in |95| 
can be formulated as temporal CSPs, and that these problems are NP-complete. 
The corresponding relations Betw and Cycl re-appeared in Cameron's theorem (The- 
orem 10.2.11. Another relation that appeared in Theorem 10.2.1 is the separation 



relation Sep. The corresponding CSP is again NP-complete. 

Proposition 10.2.5. There is a primitive positive interpretation of (Q; Betw) in 
(Q; Sep, 0), and a primitive positive interpretation of ({0, 1}; 1IN3) in (Q; Sep, 0, 1). 
The problem CSP((Q; Sep)) is NP-hard. 

Proof. The interpretation of (Q; Betw) in (Q;Sep, 0) is 1-dimensional, and the 
domain formula 8(x) is 3u,v. Sep(x,u,v,0), and defines Q\ {0} in (Q; Sep, 0). The 
coordinate map is the identity. Then the formula Sep{x, y, z, 0) interprets Betw. 
Since ({0, 1}; 1IN3) has a primitive positive interpretation in (Q; Betw, 1) by Propo- 
sition 5.5. 14| (we can choose 1 in place of since (Q; Betw) is transitive), we obtain a 
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primitive positive interpretation of ({0, 1}; UN 3) in (Q; Sep, 0, 1) by composing inter- 
pretations (see Section 5.5.4 1. Since the orbit of the pair (0, 1) is primitive positive de- 
finable in (Q; Sep) we can apply Proposition 5.5.12 and NP-hardness of CSP((Q; Sep)) 
follows from NP-hardness of ({0, 1}; 1IN3). □ 

An important relation for our classification is the relation T3, defined as follows. 

Definition 10.2.6. Let T 3 be the ternary relation 

{(at, y, z) e Q 3 I (as = y < z) V (x = z < y)} 

Proposition 10.2.7. The structure ({0, 1}; 1IN3) has a primitive positive inter- 
pretation in (Q;T 3 ,0). The problem CSP((Q;T 3 )) is NP-hard. 

Proof. We give a primitive positive interpretation / of the structure ({0, 1}; 1IN3) 
in (Q;T 3 ,0). The interpretation 

• has dimension 2; 

• the domain formula 6~i(xi,X2) is T 3 (0, x\, £2); 

• the formula 1IN3i(xi, X2, yi, 2/2, z\, z 2 ) is 

3m (T 3 (u, xi,yi) A T 3 (0, u, Z\)) ; 

• the formula =j (x 1 ,x 2 ,y 1 ,y 2 ) is T 3 (0,x 1 ,y 2 ); 

• the coordinate map h: <5/(03 2 ) — > {0, 1} is defined as follows. Let (61,62) be 
a pair of elements of 03 that satisfies Sj. Then exactly one of 61,62 must 
have value 0, and the other element is strictly greater than 0. We define 
h{b\,b 2 ) to be 1 if 61 = 0, and to be otherwise. 

To see that this is the intended interpretation, let (xi,x 2 ), (yi,y 2 ), (21, z 2 ) £ <5/(03 2 ), 
and suppose that t := (h(xi, x 2 ), h(yi, y 2 ), h(zi, z 2 j) — (1,0,0) € 1IN3. We have to 
verify that (x\, x 2 , yi, y 2 , z\, z 2 ) satisfies llNSi in 03. Since h(x\,x 2 ) = 1, we have 
x\ = 0, and similarly we get that 2/1,21 > 0. We can then set a to and have 
T 3 (u, x\, j/i) since = u = x% < y±, and we also have T 3 (0,u, Z\) since = u < Z\, 
The case that t = (0,1,0) is analogous. Suppose now that t = (0,0,1) E 1IN3. 
Then x%,yx > 0, and z% = 0. We can then set u to min(xi,yi), and therefore 
have T 3 (m, xi, yi), and T 3 (0,m, z\) since = Z\ < u. Conversely, suppose that 
(x\,x 2 ,yi,y 2 , z\, z 2 ) satisfies UNSi in 03. Since T 3 (0,u, z\), exactly one out of u, z\ 
equals 0. When u = 0, then because of Tz{u, asi, y\) exactly one out of x±,yi equals 
0, and we get that (h(x 1 ,x 2 ),h(y 1 ,y 2 ),h(z 1 , z 2 )) € {(0, 1, 0), (1, 0, 0)} C 1IN3. When 
u > 0, then x\ > and y\ > 0, and so (h(xi,x 2 ),h(yi,y 2 ),h(zi,z 2 )) = (0,0,1) G 
1IN3. 

Since the orbit of is primitive positive definable, NP-hardness of CSP((Q;T)) 



follows from the NP-hardness of CSP(({0, 1}; 1IN3)) via Proposition 5.5.12 □ 



We will see in Theorem |10.6.3| that if no relation among Betw, Cycl, Sep, T 3 , 
or Eq is primitive positive definable in a temporal constraint language 03, then 
CSP(03) is tractable. In fact, when 03 is (Q; R) for one of the relations R above, then 
we give primitive positive interpretations of ({0, 1}; 1IN3) with finitely many constants 
in 03. Thus, hardness of temporal CSPs can always be shown with Proposition |5 . 5 . 121 
We have already seen this for Betw, Sep, T 3 , f->T 3 , and E G , and close by showing it 
for Cycl. 

Theorem 10.2.8. The structure ({0,1}; 1IN3) has a primitive positive interpre- 
tation in (Q; Cycl). 

Proof. Let R — {0, l} 3 \ {(0, 0, 0)} and TV = {(0, 1), (1, 0)}. Then the relation 
NAE(x, y, z) has in ({0, 1}; R, N) the primitive positive definition 

3x', y', z' (N(x, x') A N(y, y') A N(z, z') A R(x, y, z) A R(x' , y' , z')) . 
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Hence, by Proposition 5.4.4 and Theorem 5.5. 18| it suffices to prove that ({0, 1}; R, N) 



has a primitive positive interpretation in (Q; Cycl). 

The idea of the interpretation is inspired by the NP-hardness proof of [93] for 
the 'Cyclic ordering problem' (see [95| ). The dimension of our interpretation I is 
three, and the domain formula <5j(xi, x 2 , £3) is x\ ^ x 2 A x 2 ^ £3 A £3 ^ x\, which 
clearly has a primitive positive definition in (Q; Cycl). The coordinate map h sends 
(xi,x 2 ,x 3 ) to if Cycl{xi, x 2 , x 3 ), and to 1 otherwise. Note that h(xx, X2, x 3 ) = 1 if 
and only Cycl(xi,x 3 ,x 2 ). 

Let <f>(xi, X2,xs, yi, 2/2, 2/3) be the formula 

Cycl(x 1 ,y 1 ,x 2 ) A Cycl(yx, x 2 , y 2 ) A Cycl(x 2 ,y 2 , x 3 ) 
Cycl(y 2 ,x 3 ,y 3 ) A Cycl(x 3 , y 3 ,xx) A Cycl(y 3 ,xx,yx) . 

When (ai, . . . , ag) satisfies 0, we can imagine 01, . . . , ag as points that appear clock- 
wise in this order on the unit circle. In particular, we then have that Cycl(ax, 03, 0,5) 
holds if and only if Cycl(a 2 , 04, ag) holds. The formula =/ (x\, x 2 , x 3 , yx, y 2 , y 3 ) is 

3u\,...,uj (<t>(x 1 ,x 2 ,x 3 ,u\,ul, ul) 



4+1 ,-2 +1 ,-3 +1 ) 



( = 1 



A (t)(ut,u 2 ,u 3 ,yi,y 2 ,y 3 )) 

which is equivalent to S(x 1 ,x 2 ,x 3 )AS(y 1 ,y 2 ,y 3 )A(Cycl(x 1 ,x 2 ,x 3 ) Cycl(y 1 ,y 2 ,y 3 )). 
To see this, suppose that (01,02,03,61,62,63) € Q 6 satisfies =/, and suppose that 
(01,02,03) € Cycl. Then the conjunct cj)(xi,x 2l x 3 ,u\,u 2 ,ul) of =7 implies that 
Cycl(u{, u\, u\). The conjunct 4>{u\, u l 2 , u 3 , u\ +1 , u 2 +1 , u 3 +1 ) implies that Cycl[u l ^ x , 
u 2 +1 ' u 3 +1 )> f° r au * — 3, and so by the last conjunct we finally conclude that 
Cycl(b\, 62, 63). The same reasoning shows that —j (ax, a 2 , a 3 , 61, b 2 , b 3 )ACycl(bx, b 2 ,b 3 ) 
implies Cycl(ax, a 2 , a 3 ). 

Conversely, if we have Cycl(ax, a 2 , a 3 ) and Cycl(bi,b 2l b 3 ) then we can select points 
for u\,...,u 3 that show that =/ (01,02,03,61,62,63) is satisfiable, as follows. We 
only consider orders on 01,02,03,61,62,63 that start with ax] this is without loss of 
generality, since Cycl is preserved by O. 

The formula ^i(x 1 ,x 2 ,x 3 ,y 1 ,y 2 ,y 3 ) is = 2 (xx,x 2 ,x 3 , z 1: z 3 , z 2 ). 

The formula R I (x 1 ,x 2 ,x 3 ,y 1 ,y 2 ,y 3 , Zi, z 2 , z 3 ) is 

3a, 6, c, d, e, /, g,h,i,j,k,l,m,n( Cycl(a, c,j) A Cyc/(6, j, fc) A Cycl(c, k, I) 

A Cycl{d, /, j) A Cyclic, I) A Cyc/(/, Z, m) 
A Cycl(g, i, fc) A Cycl(h, fc, to) A Cycl{i, to, n) 
A Cycl(n,m,l)A —1 (xx, X2, x 3 , a, b, c) 
A =1 {yx,y2,V3,d,e,f)/\ =/ (z 1 ,z 2 ,z 3 ,g,h,i)) . 

The proof that for all tuples 01,02,03 € Q 3 

(h(ax),h(a 3 ),h(a 3 )) e R (Q; CfycZ) |= #7(01,03,03) 



follows the correctness proof of the reduction presented in [93] . 

First, suppose that 01,02,03 g Q 3 are such that (h(ax) , h(a 2 ) , h(a 3 )) ^ i?. Hence, 
/i(etj) = and C?/d(di[l], Oj[2], di[3]) for all i < 3. Suppose for contradiction that 
there were values for the variables a, 6, c, d, e, /, (?, h, i, j, k, I, to, n in i?j such that all 
the conjuncts of Ri are true. Then the conjuncts of Ri involving the formula =/ 
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Cyclic order on (a, b) 


How to set u\ , . . . , u 3 


a\ < a 2 < a 3 < b\ < b 2 < b 3 


T 9 1 ^ 3 i 

ai < u[ < a 2 < u{ < u 2 < a 3 < < u{ < bi 
< u\ < ul < u\ < b 2 < u\ < ■ ■ ■ < u\ < b 3 


ai < a 2 < bi < a 3 < b 2 < b 3 


T 9 1 3 7 T 

ai < u{ < a 2 < u\ < u\ < u{ < bi < < a 3 
< u\ < u\ < u\ < b 2 < u\ < ■ ■ ■ < itf < b 3 


ai < a 2 < bi < b 2 < a 3 < b 3 


i o a. 7 i *y 

<X\ < u{ < a 2 < u\ < u~i < uj < Oi < «2 < 1*2 
< U2 < u\ < b 2 < a 3 < < • • ■ < u 3 < b 3 


a\ < a 2 < h < b 2 < b 3 < a 3 


3 ^ 9 1 7 2 7 

ai < u\ < u 2 < u 3 < u{ < a 2 < bi < u 2 < b 2 
< u 3 < u 3 < u\ < u\ < 63 < a 3 < u\ < u\ < u\ 


ax < &! < a 2 < a 3 < b 2 < b 3 


1 4 7 1 

ai < u{ < ■ ■ ■ < u\ < bi < a 2 < < a 3 < 
U 2 < U 2 < U 2 < b 2 < U3 < • • ■ < 1*3 < 63 


a\ < b\ < a 2 < b 2 < a 3 < b 3 


a\ < u{ < ■ ■ ■ < u\ < bi < a 2 < < • • • < u\ < 
b 2 < a 3 < U3 < • • • < u 3 < b 3 


ai < bi < a 2 < b 2 < b 3 < a 3 


a\ < u\ < bi < u 2 < u 3 < u\ < a 2 < u\ < b 2 < 
u 3 < u 3 < b 3 < u\ < u\ < a 3 < u\ < u\ < U3 


ai < bi < b 2 < a 2 < a 3 < b 3 


ai < u\ < b\ < u 2 < u 2 < b 2 < u 3 < u\ < a 2 < 
u\ < u\ < a 3 < u 3 < u 3 < b 3 


ai < &! < b 2 < a 2 < b 3 < a 3 


ai < uf < bi < 11% < 11% < b 2 < ul < u{ < a 2 < 
U3 < 1*3 < 63 < u\ < u\ < a 3 < u\ < u\ < u\ 


ai < bi < b 2 < b 3 < a 2 < a 3 


a\ < < u\ < bi < uj < v? 2 < u i < b 2 < b 3 < 
U3 < • • • < M3 < a 2 < u\ < a 3 < u\ < 1*2 



Figure 10.1. Assignments for the correctness of the primitive pos- 
itive definition of <p = . 



(h(a 1 ),h(a 2 ),h(a 3 )) 


Cycl contains 


Order on variables 


(1,0,0) 
(0,1,0) 
(0,0,1) 
(1,1,0) 

(1,0,1) 
(0,1,1) 

(1,1,1) 


(a,c, b),(d,e,f),{g,h,i) 
(a,b,c),(d,e,f),(g,i,h) 
(a,b,c),(d,e,f),{g,i,h) 
(a,c, b),(d, /, e),(g,h,i) 
(a,c, b),(d,e,f),(g,i,h) 
(a,b,c),(d, /, e),(g,i,h) 
(a,c, b),(d,e,f),(g,i,h) 


ackmbdef 'jlnghi 
abcjkdmflneghi 
abode f ' jklngimli 
ackmbdfe jlnghi 
ackmbdef jlngifi 
abcjkdmflnegik 
acbjkdmflnegifi 



FIGURE 10.2. Assignments for the correctness of the interpretation 
of ({0, 1}; 1IN3) in (Q;Cycl). We use the terminology from Sec- 
tion [Toll] 

imply that Cycl(a,b,c), Cycl(d,e, /), and Cycl{g,h,i). Then 

Cycl{a,b,c) Cycl[a ^ : '\ Cycl(b,c,j) CycL{bj ' k ' ) > Cycl(c,j,k) Cycl{c ' k ' l) ) Cycl{j,k,l) 

Cycl(d,eJ) Cycl(eJ,j) Cycl(f,j,l) C y cl(j,l,m) 

n H u -\ Cycl(g,i,k) . , . Cycl(h,k,m) , , Cycl(i,m,n) 

Cycl(g, h,i) 5- Lyclyh^i, k) > Lyclyi, k,m) > LycA(k,rn,ri) 

Cycl{j 7 k, I) Cy — - ' — Cycl(k, I, m) Cy - — Cycl(l, m, n) . 
But Cycl(l,m,n) is in contradiction to the conjunct Cycl(n,m,l). 
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Conversely, suppose that 0,1,0,2,0,3 € Q 3 are such that {h(a\), h{a,2), h{a^)) € R. 
Hence, there must be an i < 3 such that h{a,i) — 1, and thus Cycl(cLi[l], <Xj[2], flj[3]). 
We have to find values for the variables a, b, c,d,e,f,g,h,i,j,k,l,m,n of Ri so that 
all conjuncts of Ri are satisfied when evaluated at 0,1,0,2,0,3. For this, it suffices to 
specify a linear order on those variables that satisfies all the conjuncts in Rj so that 
C?/cZ(ai[l],ai[2],ai[3]) if and only if Cycl(a,b,c), Cycl(a,2[l], a2[2], «2[3]) if and only 
if Cycl(d,e, f), and Cycl(a 3 [l], a 3 [2], 03(3]) if and only if Cycl(g,h,i). Those linear 
orders can be found in the table of Figure |10.2| □ 

10.3. Endomorphisms 

In this section we study the endomorphisms of temporal constraint languages. 
As an application, we obtain a reduction of the complexity classification for temporal 
constraint satisfaction problems to the classification for those languages that admit a 
primitive positive definition of the binary relation <. 

Theorem 10.3.1. Let % be a temporal constraint language. Then exactly one of 
the following cases applies. 

(1) 23 has a constant endomorphism; 

(2) All endomorphisms 0/ 23 preserve <; 

(3) End(23) equals the set of unary operations generated by o; 

(4) End(23) equals the set of unary operations generated by O; 

(5) End(23) equals the set of unary operations generated by O and O; 

(6) End(23) equals the set of all infective unary operations. 

Proof. First note that all the cases are indeed disjoint: a constant endomor- 
phism violates <, and cannot be generated by a set of injective unary operations; 
this shows that the first case is distinct from all others. Disjointness of the remaining 



cases follows from Theorem llO.2.41 

If 23 has a non-injective endomorphism, then Corollary |5.3.7| shows that there 
is also a constant endomorphism. Otherwise all endomorphisms of 23 are injec- 
tive. We show that then all endomorphisms e of 58 are locally invertible: for any 
<2i, . . . , oj € Q there exists a self-embedding / of 23 into 23 such that /(e(oj)) = a, 
for all i € {1, ... , I}. Because e is injective, there is an a € Aut((Q; <)) such that 
ae({ai, . . . , a;}) = {a%, . . . , a;}. Then (ae) i! , i.e., the composition of (ae) . . . (ae) 
with /-factorial many terms of the form (ae), maps to itself for all 1 < i < I. Then 
(ae) i _1 a is also an endomorphism of 23, and we have ((ae)'' -1 a) (e(ai), . . . , e(a;)) = 
(ae) i! (ai, . . . , ai) — (ai, . . . , ai). This proves that e is locally invertible. 



Theorem 3.6.7 shows that the endomorphisms of 23 are generated by the automor- 



phisms of 23. The claim of the statement follows directly from Theorem 10.2.4 □ 



The following theorem shows that we can focus on constraint languages where < 
is primitive positive definable. 

Theorem 10.3.2. Let 23 be a temporal constraint language. Then it satisfies at 
least one of the following: 

(a) There is a primitive positive definition of Cycl, Betw, or Sep in 23. 

(b) Pol(23) contains a constant operation. 

(c) Aut(*&) contains all permutations o/Q. 

(d) There is a primitive positive definition of < in 23 . 

Proof. If there is a pp definition of Betw in 23 we are in case (a). Otherwise, 
since Betw consists of two orbits of triples of the automorphism group of (Q;<), 



Lemma 5.3.5 shows that 23 has a binary polymorphism that violates Betw. If there 
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is a pp definition of < in 25, we are in case (d). Otherwise, again by Lemma 5.3.5 



there is a unary polymorphism of 25 that violates <. Proposition |1 0.3.1] shows that 
25 is preserved by a constant, — , or O- For each of these three operations we show 
the claim of the statement separately in the following three paragraphs. 

If 25 is preserved by a constant we are in case (b) , so we assume in the following 
that 25 is not preserved by a constant. 

If 25 is preserved by — , the relation Betw consists of only one orbit of triples, 
and Lemma |5.3.5| shows that there is an endomorphism that violates Betw. Proposi- 
tion 10.3.1 then implies that 25 is also preserved by O. Thus, the relation Sep consists 
of only one orbit of 4-tuples. Again, either Sep has a pp definition, and we are in case 
(a), or there is an endomorphism that violates Sep. Proposition 10.3.1 now shows 
that 25 is preserved by all injective unary operations and we are in case (c). 

If 25 is preserved by O, then the relation Cycl consists of only one orbit of triples. 



If Cycl has a pp definition in 25, we are in case (a). Otherwise, Lemma 5.3.5 shows 



that there is an endomorphism that violates Cycl. Proposition 1 1 . 3 . l] t hen shows that 
25 is also preserved by — . But the statement of the lemma has already been shown in 
the case that 25 is preserved by both — and O in the previous paragraph, so we are 
done. □ 

In case (a), there is a finite signature reduct 25' of 25 such that CSP(25') is NP- 



hard, as we have seen in Section 10.2.3 In case (b), for all finite signature reducts 25' 



of 25 the problem CSP(25) is trivially in P (sec Proposition 1.1.11). In case (c) the 
complexity of CSP(25) has been classified in Chapter [6] In the following, we therefore 
study only those temporal constraint languages where < is pp definable. 

10.4. Lex-closed Constraints 

First-order expansions of (Q; <) can be divided into four (non-disjoint) groups: 
those where the betweenness relation is primitive positive definable, those that are 
preserved by an operation called pp, an operation called dual-pp, or by the binary 
injective operation called lextheX we have already encountered in Section |8.3.3| None 
of the three polymorphisms pp, dual-pp, and lex alone guarantees tractability of the 
CSP. An illustration of the complexity classification result for first-order expansions 



of (Q; <) can be found in Figure 10.3 



10.4.1. The operations lex and 11. An important class of temporal constraint 



languages are the languages preserved by the operation lex, introduced in Section|8.3.3 



Recall that lex is a binary injective operation on Q such that lex(a, b) < lex(a' , b') if 



either a < a', or a — a' and b < b'. By Observation 10.2.3[ all such operations 
generate the same clone. We also write 

• lex VtX for the operation (x,y) i-> lex(y,x), 

• lex y - x for the operation (x,y) i-> lex(y, — x), 

• lex x> - y for the operation (x,y) h-> lex(x, —y), 

• lex XtV for the operation (x,y) i-> lex{x,y), 

• p x for the operation {x, y) i-> x, and 

• p y for the operation (x,y) >->• y. 

A k-aiy operation / : Q k — > Q is dominated by the i-th argument when for all 
a, b € Q k it holds that f(a\, . . . , au) < f{b\, ...,&&) if and only if ai<bi. Examples of 
operations dominated by the first argument are p x , lex x y , and lex x _ y , and examples 
of operations dominated by the second argument are p y , lex V:X , lex y _ x . 

It is easy to see that the relation Betw is preserved by lex, and more generally 
by all operations that are dominated by one argument. Therefore, we are interested 
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in P 




Figure 10.3. An illustration of the classification result for temporal 
constraint languages that contain <. Double-circles mean that the 
corresponding operation has a dual generating a distinct clone which 
is not drawn in the figure. 






Figure 10.4. Illustrations of the six basic operations lex XyV , 

l&Ey,X) teXy^—xi Px, Py- 



lex 



•x,-yi 



in further restrictions of languages preserved by lex that imply tractability of the 
corresponding CSP. 

A large tractable temporal constraint language has been introduced in 39 . The 
language is defined in terms of a binary polymorphism, denoted by 11, and it has a 
dual version, which is tractable as well. We will see in Proposition |10.4.7| that this 



language contains the class of Ord-Horn constraints (Section 1.5.9). 

Definition 10.4.1. Let 11: Q 2 -> Q be such that ll(a,b) < ll(a' ,V) if 

• a < and a < a! , or 

• a < and a = a' and b < b' , or 

• a, a' > and b < b' , or 

• a > and b = b' and a < a' . 

All operations satisfying these conditions are by definition injective, and they all 
generate the same clone. For an illustration of 11 and its dual, see Figure |10.5[ It is 
easy to see that 11 generates lex. 

10.4.2. Operations generating 11, dual-ll, or lex. In this section we present 
operations that generate 11, dual-ll, or lex. We again use the concept of a behavior 
of operations over a relational structure; note that a fc-ary operation / behaves like 




a fc-ary operation g on S = Si x • • • x Sk if for all t,t' € S we have f(t) < f(t') 
iff g(t) < <?(£')■ That is, the weak linear order induced by / on the tuples from G 



(in the sense of Observation 10.2.31 is the same as the weak linear order induced on 
these tuples by g. Let Q + denote the set of all positive rational numbers, and let Qq 
denote Q\Q+. 

Definition 10.4.2. Let f,g be from Q 2 — > Q. Then [f\g] denotes an arbitrary 
operation from Q 2 — > Q with the following properties. For all x,x',y,y' € Q, 



i/x < and x' > tfien [f\g]{x,y) < [f\g](x', y'); 
[f\g] behaves like f on Qq x Q; 
[f\g] behaves like g on Q + x Q; 



For example, if / = lex x>y and g — leXy tX , then [f\g] behaves like 11. 

Lemma 10.4.3. Let f,g € {lex XtV ,lex Xt -y,leXy tX ,leXy t - x ,p x ,py}, and let f (g ) 
be lex x y if f (g) is dominated by the first argument, and leXy >x otherwise. Then 
{lex, [f\g]} generates [f'\g'](x,y). 

Proof. By Proposition 1 5.2.1] it suffices to show that every relation R preserved 
by lex and [f\g] is preserved by [/'|(?']. So let R be an arbitrary relation preserved 
by lex and [f\g], let k denote its arity, and let ti,t2 be fc-tuples from R. We have to 
show that i 3 := [f'\g'](ti,t 2 ) is in R. 

Let a € Aut((Q; <)) be such that for each entry x of t\ and for each entry y of 
t 2 , the value of ulex(x,y) is negative when x < 0, and positive otherwise. We will 
show that there is an automorphism of (Q; <) that maps the tuple 

s := [f\g}{otlex{tx,h),lex{t2,ti)) 

to £3, which proves that £3 is in R. It suffices to show for ji, j'2 G [&] that 

< s\j 2 ] if and only if t s \j x ] < t 3 [j 2 ] . (14) 

We can assume that ti[j'i] < ti[j2] by exchanging the name of j\ and j'2 if neces- 
sary, and distinguish three cases: 

• ti[7i] < 0, ti [i 2 ] > 0. Then f 3 [ji] < ta[? 2 ] by definition of [/%']. Since for 
j G [fc], the value of alex(ti[j], ^[j]) is positive if and only if the value of 
ti[j] is positive, we have s[ji] < s[j2] by definition of Thus we have 



verified ( 14 ) in this case. 
• *i[?2] < 0. Note that f(lex(x,y), lex(y,x)) behaves like f'(x,y). Thus, writ- 
ing a[j] for Ze:r(ii[j],t2b]) and b[j] for Zea;(t2[j]j ti [j] ) , we have the following 
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FIGURE 10.6. An illustration of the operation [pjjpj,] (on the left) 
and the operation [lex y ^ x \lex ViX ] (on the right). 



equivalences. 

hVi] < h\j 2 ] iff f(t 1 [ J1 }Mji})<f'(h[j2}Mj2}) 

iff Harbin}) < f(a\j 2 ],b\j 2 ]) 
iff f(aa\3i],b\ji]) < f(aa\j 2 },b\j 2 }) 
iff <«[*»] 

• ti\ji] > 0. This case is analogous to the previous one and left to the reader. 

□ 

Lemma 10.4.4. For f,g G {p y , lex y ^ x } the operation [f\g] generates [lex XtV \g\. 

In particular, for f = g = lex VtX the lemma shows that [f\g] generates 11. For 
f — g — p yi the lemma shows that [f\g] generates [lex x tV \p y ] and in particular lex x>y . 
See Figure [T0.6| for illustrations of those two cases. 

Proof of Lemma fl 0.4. 41 We show that every relation R preserved by [f\g] is 



preserved by [lex x ^ y \g], and conclude by Proposition 5.2.1 that [f\g] generates [lex X:V \g 



So let R be an arbitrary relation preserved by [f\g], let k denote its arity, and let t\, t 2 
be fc-tuples from R. We have to show that t 3 := [lex XyV \g](ti,t 2 ) is in R. 

Let I denote the number of non-positive values in t\. We take a\, . . . , ai from 
Aut((Q; <)) such that an maps all but the i smallest values in t\ to positive values. 
We define a sequence of tuples si, . . . , s\ as follows: s\ =t 2 , and for i > 2 

Si ■= [f\g](aiti, Si-i) . 

Clearly, for all i G [I] the tuple Si is in R. We will show that there is an automor- 
phism of (Q; <) that maps si to £3, which proves that t 3 is also in R. By symmetry 
it is enough to show for j\,j 2 G [k] with ti[?i] < ti\j 2 ] that 

si[ji] < si[j 2 ] if and only if t 3 [ji] < h[j 2 ] . (15) 

We distinguish three cases: 

• h[ji] = ti[j 2 ] < 0. Since a^i[ji] = a^i[j 2 ] for all i€ [I], we have s/[ji] < 
si[]2] if and only if s\[ji] < si[j 2 }. Since si = t 2 and < 0, and 
because / is dominated by the second argument, si[ji] < si[j 2 ] if and only 
if t 3 \ji] < t 3 [j 2 ], which proves Q . 

• < ^1^2], < 0. Let i G [Z] be such that a-iii[ji] < and aiii[j 2 ] > 
0. By definition of [f\g] we see that sjji] < Si[j 2 }. Because ajti[j'i] < 
Q!i*i[j2] for all i G [I], and because [f\g] preserves <, by induction on i' > i 
we have that Si'[ji] < s i'[.h}- In particular, s;[ji] < Sj[ja]. On the other 
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hand, t 3 [ji] < £3^72] by definition of lex xy and [fez^jjg], and so (15) also 
holds in this case. 

> 0- Observe that by the choice of I we have ajiijii] > for all ie [I]. 
Thus (151 holds, because both [f\g] and [fer^lg] behave like g on Q + x Q. 



□ 

10.4.3. Syntactic descriptions. In this section we present a syntactic charac- 
terization of 11-closed relations. The result is based on a result for injective constraint 



languages; recall from Definition 9.4.1 that a relation R is injective if every tuple in 
R only contains tuples with pairwise distinct entries (injective temporal constraint 
languages have been called ordering constraints in the literature on random CSPs, 
and in the literature on approximation ratios for MaxCSPs). 

We first give a syntactic characterization of injective temporal relations that are 
preserved by an operation [f\g] where / is dominated by the first argument and g is 
dominated by the second argument. 

Proposition 10.4.5. Let f,g be binary polymorphisms of (Q; <) such that g is 
dominated by the second argument. Let R be an injective temporal relation that is 
preserved by [f\g\. Then R has a definition by a conjunction of clauses of the form 
zq > Z\ V • • • V zq > z\ and of the form x =/= y. 



Proposition 10.4.5 is a consequence of Lemma 10.4.6 below, which is more flexible 
for later use since it applies to arbitrary temporal relations, not just to injective ones. 
Note that every temporal relation R can be defined by a quantifier-free formula in 
conjunctive normal form where all literals are of the form x > y or x — y; to see 
this, take any quantifier-free formula in conjunctive normal form that defines R over 
(Q; <) and " 

• replace ->(x < y) by the two literals (x > y) V (x — y)\ 

• replace x ^ y by the two literals (x > y) V (y > x); 

• replace x < y by y > x. 

We call formulas in quantifier- free conjunctive normal form where all literals are of 
the form x > y or x — y standard formulas. A clause is bad if it is not of the form 

Zq > Z\ V ■ • • V Zq > Z\ . 

Lemma 10.4.6. Let R be a temporal relation. If R is preserved by the operation 



[f\g] from Proposition 10.4-5, then there is a standard formula without bad clauses 



such that an injective tuple is in R if and only if it satisfies (/>. 

Proof. Suppose that R is n-ary. Let be a standard formula such that an 
injective tuple is in R if and only if it satisfies <f>; choose <f> in such a way that it has 
a minimal number of bad clauses. We can assume without loss of generality that <f> 
does not contain literals of the form x — y; otherwise, we can simply remove those 
literals and work with the resulting formula instead. Suppose for contradiction that <f> 
contains a bad clause C. Then C must contain two literals 1% ;= u > v and I2 '■= r > s 
where u and r are distinct variables. Let V be the set of variables of <f>. We claim 
that <p has an injective satisfying assignment i\ : V — > Q such that l\ is the only literal 
satisfied by i\ in C . Otherwise, the formula obtained from (j> by removing l-y from C 
still has the property that every injective tuple is in R if and only if it satisfies <j>. 
Moreover, the number of bad clauses in the new formula is also minimal, which is 
impossible by the choice of <p. Similarly one can see that (f> has an injective satisfying 
assignment i 2 : V — > Q such that l 2 is the only literal satisfied by i 2 in C. 

Suppose for contradiction that i\ can be chosen such that i\(r) is smaller than 
ii(s), ii(u), and ii(v). Let a 6 Aut((Q; <)) be such that ai(r) = 0. Consider the 
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mapping i : V — > Q defined by io(x) = [f\g}( a H( x ), ^(x))- If a literal in some clause 
of <j) is not satisfied in both tuples t\ and t 2 , then it is also not satisfied in t, because 
[f\g] and a preserve <. Therefore l\ and I2 are the only literals of C satisfied by 
io Since «o( r ) is strictly smaller than io(s), the literal I2 cannot be satisfied by io- 
Since i%{v) > and g is dominated by the second argument, it also holds that 

io(v) > io{u), and hence l\ is not satisfied in io either. So io does not satisfy 0, in 
contradiction to the assumption that R is preserved by [f\g]. 

An analogous argument shows that ii cannot be chosen such that %i (u) is smaller 
than i2(v), 12(1°), and ^(s). We claim that all injective satisfying assignments j : V —> 
Q of also satisfy (u > v) V (u > s). Otherwise, j(u) < j(v) and j(u) < j(s), 
therefore j(u) > j(r) to satisfy the property from the beginning of the paragraph. 
Hence, j(r) < j(v) and j(r) < j(s). But then j(r) is smaller than ^(w), izir), and 
«2(s), in contradiction to what we have shown before. Analogously we can show that 
any injective satisfying assignment of </> also satisfies (r > v) V (r > s). 

Let 0' be the formula obtained from by removing C and adding the two clauses 
(u > v) V (u > s) and (r > u) V (r > s). We show that an injective tuple satisfies ^' 
if and only if it satisfies 4>. By what we have seen above, it suffices to show that cf>' 
implies <fi. Let j be any satisfying assignment of </>'. Clearly, all the clauses of <j) except 
for C are satisfied by j, because they are also present in (/>' . We can reformulate the 
two additional clauses in <f>' to 

(u > v Ar > v) V (u > v Ar > s)V (u > s Ar > v)V (u > s Ar > s) . 

If the first, the second, or the fourth disjunct is satisfied by j, then j(u) > j(v) V 
j( r ) > j( s )> an d therefore C holds in j. If the third disjunct is satisfied by j and 
the literal h does not hold (i.e., j(u) < j(v)), we have the chain of inequalities 
j(s) < j(u) < j(v) < j(r) and hence I2 is satisfied. The formula 4>' has fewer bad 
clauses than cj>, contradicting the choice of 4>. □ 

Before we present the syntactic description for 11-closed temporal relations, we 



first show that in particular all Ord-Horn relations (Section 1.5.9) are preserved by 11 



Proposition 10.4.7. All relations in Ord-Horn are preserved by 11 and dual 11. 

Proof. We give the argument for 11 only; the argument for dual 11 is analogous. 
It suffices to show that every relation that can be defined by a formula <j> of the form 
(x\ = 3/1 A • • ■ A Zfc-i = Vk-i) -> %k O y k is preserved by 11, where O G {=, <, <, 7^}. 
Let t\ and t% be two 2fc-tuples that satisfy <j>. Consider a 2/c-tuple £3 obtained by 
applying 11 componentwise to t\ and t%. Suppose first that there is an i < k — 1 such 
that one of the tuples does not satisfy Xi = Ui- Then x t = j/j is not satisfied in t 3 
as well, by injectivity of 11, and therefore the tuple t 3 satisfies 0. Now consider the 
case that n = yi holds for all i < k — 1 in both tuples t% and ti- Since t\ and t 2 
satisfy 0, the literal XkOyk holds in both t\ and t%. Because 11 preserves all relations 
in {=, <, <, 7^}, the literal XkOyk holds in t%, and therefore £3 satisfies 4> as well. □ 



Since the relation R mm defined by (x > j/)V(x > z) (see Section 1.5.8) is 11-closed 
(this follows directly from Proposition 10.4. 8| below) but not Ord-Horn, the class of 



11-closed constraints is strictly larger than Ord-Horn. A formula is called ll-Horn if it 
is a conjunction of formulas of the following form 

(xi = y% A • • • A x k = yk) -> (zo > zi V • • • V z > z{) , or 

(xi = yi A ■ ■ ■ A x k = yk) -> {zo > zi V ■ • • V z > z t V (z = z\ = ■ ■ ■ = Zi)) 

where < k, I. Note that k or / might be 0: if k = 0, we obtain a formula of the 
form zo > z\ V • • • V Zq > z\ or (zo > Z\ V • • • V Zq > z\ V (zq = Z\ = ■ ■ ■ = z;)), 
and if I = we obtain a disjunction of disequalities. Also note that the variables 
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Xi, . . . , Xk, Di, ■ ■ • i Hk) Z0) ■ ■ ■ > z i need not be pairwise distinct. Also note that the 
clause zi > Z2 V Z3 > Z4 is not 11-Horn. 

Proposition 10.4.8. A temporal relation is ll-closed if and only if it can be 
defined by an ll-Horn formula. 

Proof. The proof that every relation defined by an 11-Horn formula is ll-closed 
is similar to the proof of Proposition |10.4.7| We just need to additionally check 
that the relation defined by z > z\ V • • • V Zq > z\ and the relation defined by 
zq > z\ V • • • V zq > z\ V (zq = ■ ■ ■ = zi) are preserved by 11, which is straightforward. 

The proof of the reverse implication is by induction on the arity n of the temporal 
ll-closed relation R. For n = 2 the statement of the proposition holds, because all 
binary temporal relations can be defined by 11-Horn formulas. For n > 2, we construct 
a formula ip that defines R as follows. 

Let be a standard formula with minimal number of bad clauses such that a) an 
injective tuple is in R if and only if it satisfies 4>, and b) any formula obtained from 
<j> by removing a literal from a clause does not satisfy condition a). Clearly, such a 
formula exists: we can start from any standard formula that satisfies a) and has a 
minimal number of bad clauses, and then remove repeatedly literals from clauses if the 
resulting formula still satisfies a); since deleting literals does not create bad clauses, 



we eventually find a formula that satisfies both conditions a) and b). Lemma 10.4.6 
shows that cf> does not contain bad clauses. 

For all pairs of entries i,j € {1, . . . , n}, i < j, let Rij be the relation defined by 
Rij(xx, . . . ,Xi_i,x i+1) ...,i„)» R(xi, . . .,Xi-x,Xj,x i+ i, . . . ,x n ). Because also R iyj 
is ll-closed, it has an 11-Horn definition <f>^ by inductive assumption. We add to each 
conjunct of faj a literal Xi = Xj to the premise of the implication, such that (p^j 
remains an 11-Horn formula. Let ip be the formula that is a conjunction of 

• all the modified clauses from all formulas (pi,j'i 

• all clauses C(zq, . . . , z{) of <f> such that R does not contain a tuple where 
zq, Zi, . . . , Zi all get the same value; 

• the formula C(zo, . . . , z{) V (zq = Z\ = ■ ■ ■ = z\) for all other clauses C of <f> 
with variables Zq,Z\, . . . ,z%. 

Obviously, ip is 11-Horn. We have to verify that ip defines R. Let t be an n-tuple 
such that t ^ R. If t is injective, then some clause C{zq,Z\,...,z{) of <fi is not 
satisfied by t. The variables zq,z\, . . . ,zi of C cannot all have the same value in t, 
and so if 1S not satisfied either. If there are i,j such that t[i] = t[j] then the tuple 
tj = (t[l], . . . ,t[j — 1], t[j + 1], . . . , t[n]) ^ R iy j. Therefore some clause C of faj is not 
satisfied by tj, and C V Xi 7^ Xj is not satisfied by t. Thus, in this case t does not 
satisfy ijj, too. 

We also have to verify that all t € R satisfy ip. Let C be a conjunct of ip created 
from some clause in cf>ij. If t[i] ^ t[j], then C is satisfied by t because C contains 
Xi 7^ Xj. If t[i] — t[j], then (t[l], . . . ,t[j — l],t[j + 1], . . . ,t[n]) € Rij and thus this 
tuple satisfies <f>i t j. This also implies that t satisfies C. 

Finally, let C be a conjunct of ip created from some clause of 4>- Then C is of the 
form x Ua > x Ul V • • • V x Uo > x Um or of the form x UQ > x Ul V • • • V x Uo > x Um V (x« = 
x Ul — ■ ■ ■ = x Um ). If t is constant on the variables of C, then, by construction of 
ip, C contains the disjunct x Uo = x Ul = ■ ■ ■ = x Um and is satisfied. So suppose t 
is not constant on the variables of C. Assume for contradiction that t[uo] < t[ui] 
for all i 6 [rn]. Since t is not constant, there is a j € [to] such that t[uo] < t[uj}. 
By our assumptions on (j>, there is an injective tuple t' € R that satisfies only the 
literal x Uo > x u . in C. Consider the tuple t" = lex(t,t'). Because t' has pairwise 
distinct entries and lex is an injective operation, t" also has pairwise distinct entries. 
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Figure 10.7. Illustration of an operation / generated by 11 that 
satisfies f(x,y) = af(f3(x),P(y)). 



Since t[u ] < t[m} for all i 6 [to], t'{uo] < t'[ui] for all i G [to] \ {j}, and because lex 
preserves <, we get that t"[u ] < t"[ui] for all i G ["i]\{j'}- Finally, since t[u ] < t[uj] 
we also have that t"[u ] < t"[uj] by the properties of lex. Hence, t"[u ] < t"[ui] for 
all i G [to]. Therefore t" does not satisfy C and thus also does not satisfy <fi. But t" 
is injective and is from R (because R is 11-closed), in contradiction to the properties 
oicjy. □ 

10.4.4. Commutativity Modulo Automorphisms. For a uniform presen- 
tation of the tractability results in Section |10.6[ we need a different description of 
11-closed temporal constraint languages. Partition the rationals Q = Qi W Qi into two 
dense subsets Q\ and Q2, and let g be any injective binary operation that preserves 
< and < such that 

• for x G Qi, we have that f(x,yi) > f(y2,x) for all 2/1,2/2 > x; 

• for x G Q2, we have that f(x, yi) < /(j/2, x) for all j/i, 2/2 > x. 

Proposition 10.4.9. There are automorphisms a,f3 o/(Q;<) and an operation 
f generated by g such that for f(x, y) — af(/3y, (3x) holds for all x, y G Q. 

Proof. By Lemma 3.1. 8| it suffices to show that for all finite subsets S of Q 
there exist automorphisms a' , /3' of (Q; <) such that g{x,y) = a'g((3'y, f3'x) for all 
x, y G S. Let {s\, . . . , s n } = S with si < • • • < s n . By adding extra elements to S 
we can assume without loss of generality that n is even, S2i-i G Qi, and S2i G Q2 for 
1 < i < n/2. Then the strict linear order induced by g((3'y, (3'x) on S 2 is the same as 
the strict linear order induced by g(x, y), and hence there exists an automorphism a' 
that maps g{j3'y, (3'x) to g(x, y) for all x,y G S. □ 

For an illustration of /, see Figure |10.7| The red vertices are the elements of 
{(x,x) I x G Qi}, and the blue blue vertices the elements of {(x,x) \ x G Q2}- 

Proposition 10.4.10. The operation f defined above is generated by 11, and gen- 
erates II. 



Proof. It is easy to see that / interpolates 11. For the converse, it suffices to 
verify that / preserves all 11-Horn formulas, by Proposition 10.4.8 Since g is injective 
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6.1.5) 



for every j £ [k] . It is 



and / is generated by g and depends on both arguments, ii is also injective; therefore, 
it suffices to show that / preserves formulas of the form 

(z > zi) V • • • V Oo > zi) 

and formulas of the form 

(zq > z{) V • • • V (z > Zi) V (zo = Zi = ■ ■ ■ = zi) . 

Preservation of formulas of the latter type reduces to the former type, since / preserves 
< and is injective binary. Now suppose that a = (do, ai, . . . , a{) and b = (bo, b%, . . . , b{) 
are two tuples that satisfy zq > z± V • • • V zq > z\. Assume that clq < bo. Let i be 
such that dj = min(ai, . . . , ai). Then ao > di, and we get that f(ao, bo) > f(ao, clq) > 
f(di,ai) > f(ai,bi). Therefore, f(a ,b ) > min(f(ai,b 1 ),...,f(ai,bi)). We can 
argue analogously in the case that a > b . □ 

10.4.5. An Algorithm for 11-closed Constraints. In this section we present 
an algorithm for 11-closed constraints. One of the underlying ideas of the algorithm is 
to use a subroutine that tries to find a solution where every variable has a different 
value. If this is impossible, the subroutine must return a set of at least two variables 
that denote the same value in all solutions - since the constraints are preserved by a 
binary injective operation, such a set must exist (Proposition 

The i-th entry in a /c-tuple t is called minimal if t[i] < t[j 
called strictly minimal if t[i] < t[j] for every j 6 [k] \ {i}. 

Definition 10.4.11. Let R be a k-ary temporal relation. A set S C [k] is called 
a min-set for the i-th entry in R if there exists a tuple t G R such that the i-th entry 
is minimal in t, and for all j G [k] it holds that j £ S if and only if t[i] — t[j]. We 
say that t is a witness for this min-set. 

Let R be a fc-ary relation that is preserved by lex (recall that 11-closed constraints 
are preserved by lex as well), and suppose that the i-th entry has the min-sets 
Si,..., Si, for I > 1, with the corresponding witnesses t\,...,ti. Consider the tu- 
ple t :— lex(fi , lex(t 2 , ■ . ■ lex(ti—i,ti))). Since the entry i is minimal in every tuple 
ti,...,ti, and since lex preserves both < and <, it is also minimal in t. Because lex 
is injective, we have that t[i] = t[j] if and only if these two entries are equal in each 
tuple ti, . . . , t%. Hence, the min-set for the i-th entry in R witnessed by the tuple t is 
a subset of every other min-set Si, ■ . ■ , Si. We then call this set the minimal min-set 
for the i-th entry in R. 

Lemma 10.4.12. Let R be a k-ary relation preserved by lex, and let S be the 
minimal min-set for the i-th entry in R. If t £ R is such that t[j] > t[i] for every 
j £ S, then t[i] = t[j] for every j £ S . 

PROOF. Let t' £ R be the tuple that witnesses the minimal min-set S. Let t £ R 
be such that not all entries in S are equal (in particular, |5| > 1). Consider the tuple 
,s := lex(t',t). By the properties of lex it holds that s[i] < s[j] for every j £ [k] \ S. 
Furthermore, s[i] < s\j] for j £ S if and only if t[i] < t[j]. Thus, unless s witnesses a 
smaller min-set for i in R (which would be a contradiction), we have that s[i] > s\j] 
for some j £ S. □ 

To develop our algorithm, we use a specific notion of constraint graph of a tem- 
poral CSP instance, defined as follows. 

Definition 10.4.13. The constraint graph of a temporal CSP instance cf> is 
a directed graph (V; E) defined on the variables V of <p. For each constraint of the 
form R(xi, . . . , Xk) from <f> we add a directed edge (xi, Xj) to E if in every tuple from 
R where the i-th entry is minimal, the j-th entry is minimal as well. 
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Definition 10.4.14. If an instance of a temporal CSP contains a constraint 4> 
imposed on y such that 4> does not admit a solution where y denotes the minimal 
value, the we say that y is blocked (by </>) . 

We can easily determine for each constraint which variables are blocked by this 
constraint: For a constraint represented by weak linear orders we just check all weak 
linear orders and build a set of variables that are not minimal in any of them. For a 
constraint represented by an 11-Horn formula, a variable Xi is blocked if and only if the 
formula is of the form Xi > z\ V • • • V Xi > z\. Thus, by inspecting all the constraints 
it is possible to compute the blocked variables in linear time in the input size. We 
want to use the constraint graph to identify variables that have to denote the same 
value in all solutions, and therefore introduce the following concepts. 

Definition 10.4.15. A strongly connected component if of the constraint graph 
G^ for a temporal CSP instance <j> is called a sink component if no edge in G^ leaves 
if , and no variable in if is blocked. A vertex of G that belongs to a sink component 
of size one is called a sink. 

Lemma 10.4.16. Let 03 be a lex-closed temporal constraint language. Let <f> be 
an instance of CSP(03) with variables V , and let K C V be a sink component of the 
graph Gj,. Then in every solution of (f> all variables from if must have equal values. 

Proof. We assume that <f> has a solution s: V — > Q, and that if has at least 
two vertices; otherwise the statement is trivial. Define M := {x G if \ s(x) < 
s(y) for all y € if}. We want to show that M = if. Otherwise, because if is a 
strongly connected component, there is an edge in G^ from some vertex u £ M to 
some vertex v £ K\M. By the definition of G^, there is a constraint rp in <f> such 
that whenever u denotes the minimal value of a solution of tp, then v has to denote 
the minimal value as well. By permuting arguments, we can assume without loss of 
generality that tp is of the form R(wi, . . . , Wk) where w\ = u. Because if is a sink 
component, the variable u cannot be blocked, and hence there is a minimal min-set 
S for the first entry in R. 

Note that G$ contains an edge from u to wt for all i £ S. Since if is a strongly 
connected component, all these variables Wi are in if. Because s{u) < s(y) for all 
y £ if, there is no variable mj, i £ S, such that s(wi) < s(u). This contradicts 



Lemma 10.4.12 because s(u) ^ s(v). □ 

Lemma 1 1 . 4. 1 6| immediately implies that we can add constraints of the type x = y 
for all variables x, y from the same sink component if. Equivalently, we can consider 
the CSP instance where all the variables in if are contracted, i.e., where all variables 
from if are replaced by the same variable. When <f) = 3xi, . . . , x n (tf>\ A • • • A 4> m ) is an 
instance of a CSP(23), and Xi £ V :— {xi, . . . , x n }, then we write <j>[V \ {xi}] for the 
formula 3x±, . . . ,Xi-i,Xi+i, . . . , x n ((3xi.<f)i) A- • ■ A(3xi.<p m )) . Note that if 03 contains 
all primitive positive definable relations whose arity is bounded by the maximal arity 
of of the relations in 03, then <f>\V \ {xi}} can be viewed as an instance of CSP(03). 

Lemma 10.4.17. Let 03 be an ll-closed temporal constraint language. Let cf> be an 
instance o/CSP(03) with variables V, and let x be a sink in G^. If <p\V \ {x}] has an 
infective solution, then has an infective solution as well. 

Proof. Let s: V — > Q be an injective solution to (f>[V\ {x}]. We claim that any 
extension r of s to x such that r(x) < s(y) for all y £ V \ {x} is injective and satisfies 
<fi. If x appears in no constraint in <fi, the statement is trivial. Consider a constraint 
tp — R{x\, . . . , xu) from <f> that is imposed on x, and let S C [k] be such that i £ S if 
and only if x = Xi. By the definition of <p[V \ {x}], the mapping s has an extension 
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Spec((j>) 

II Input: An instance <j> of CSP(25) with variables V . 

II Output: If algorithm returns false then <j> has no solution. 

// If (j> has an injective solution, then algorithm returns true. 

II Otherwise return S C V, \S\ > 2, such that 

// for all x,y £ S we have x — y in all solutions to <f>. 

Set Y := 

While G,j contains a sink s 
Y := Y U {s} 

If Y = X then return true 
else := \ Y] 

if G</> has sink component S return S 
else return false 
end if 



FIGURE 10.8. A polynomial-time algorithm for CSP(Q3) when 25 is 
11-closed: the sub-procedure Spec. 



Solve ($) 

/ / Input: An instance $ with variables V. 

1 1 Output: accept if $ is true, reject otherwise. 

S := Spec(<I>) 

If S = false then reject 

else if S = true then accept 

else 

Let $' be contraction of S in $. 
Return Solve($')- 
end if 



FIGURE 10.9. A polynomial-time algorithm for CSP(Q5) when 58 is 
11-closed: the main procedure. 



s' that is also defined on x such that (s'(xi), . . . ,s'(xk)) € R. Because a; is a sink, 
there is tuple t € R such that S is the minimal min-set for the i-th entry of R for 
each i £ S. Let t' be the tuple (s'(xi), . . . , s'(xk)), and let a £ Aut((Q; <)) be such 
that a(s'(x)) = 0. Then r := ll(at' ,t) £ R. Note that for i,j £ [k] \ S, we have that 
r [i\ < r [j] if an d only if < r{xj). Hence, r satisfies all constraints from (f>, which 
is what we had to show. □ 

Our algorithm for 11-closed constraints can be found in Figure |10.9| we are now 
ready to prove its correctness. 



Theorem 10.4.18. The procedure Solved) in Algorithm 10.9 decides whether a 
given set of ll-closed constraints $ has a solution. There is an implementation of the 
algorithm that runs in time 0(nm), where n is the number of variables of $ and m 
is the size of the input. 

PROOF. The correctness of the procedure Spec immediately implies the correct- 
ness of the procedure Solve. In the procedure Spec, after iterated deletion of sinks in 
G^, we have to distinguish three cases. 

In the first case, Y — X. In this case it follows by a straightforward induction 
from Lemma [10.4. 17| that <fi has an injective solution. Otherwise, in the second case, 
Gtf, contains a sink component S with |5| > 2. We claim that for all variables x,y £ S 
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we have x = y in all solutions to </>. Lemma 10.4.16 applied to <f)[X \ Y] implies that 
all variables in the same sink component must have the same value in every solution, 
and hence the output is correct in this case as well. 

In the third case, F / I, but G<j, does not contain a sink component. Note 
that in every solution to <f> some variable must take the minimal value. However, 
since each strongly connected component without outgoing edges contains a blocked 
vertex, there is no variable that can denote the minimal element, and hence <f> has no 
solution. Because (j> is at all times of the execution of the algorithm implied by the 
original input constraints, so the algorithm correctly rejects. 

Since in each recursive call of Solve the instance in the argument has at least 
one variable less, Solve is executed at most n times. It is not difficult to implement 
the algorithm such that the total running time is cubic in the input size. However, 
it is possible to implicitly represent the constraint graph and to implement all sub- 
procedures such that the total running time is in 0(nm); for the details, we refer 
to [39]. □ 

10.5. Shuffle-closed Constraints 

An important subclass of temporal constraint languages are shuffle closed con- 
straint languages. As we will see, there are NP-complcte shuffle-closed constraint 
languages. However, in this section we present three additional restrictions for shuffle- 
closed constraint languages that imply that the corresponding CSPs can be solved in 
polynomial time. 

10.5.1. Shuffle closure. We define shuffle closure, and show how shuffle closure 
can also be described by a certain binary operation on Q. 

Definition 10.5.1. A k-ary relation R is called shuffle closed iff for all tuples 
ti, ti £ R and all indices I £ [k] there is a tuple £3 £ R such that for all i,j £ [k] we 
have t 3 [i] < t 3 [j] iff 

• ti[i\ < ti[l] and ti[i] < ti[j], or 

• ti[l] < ti[{\, tx[l] < hlj], and t 2 [i] < t 2 \j]. 

Let pp be an arbitrary binary operation on Q such that pp(a,b) < pp(a',b') iff 
one of the following cases applies: 

• a < and a < a' 

• < a, < a', and b < b' . 

Clearly, such an operation exists. For an illustration, see the left diagram in Fig- 
ure 10.10| In diagrams for binary operations / as in Figure 10.10 we draw a directed 
edge from (a, b) to (a',6') if f(a,b) < f(a',b'). Unoriented lines in rows and columns 
of picture for an operation / relate pairs of values that get the same value under /. 
The right diagram of Figure 1 10. 10| is an illustration of the dual-pp operation. The 
name of the operation pp is derived from the word 'projection-projection ', since the 
operation behaves as a projection to the first argument for negative first argument, 
and a projection to the second argument for positive first argument. 

Proposition 10.5.2. A temporal relation is shuffle- closed if and only if it is 
preserved by pp. 

Proof. Let R be a shuffle-closed relation, and let t\ and t 2 be tuples from R. 
We want to show that £ 3 = pp(ti,t 2 ) £ R. If t\ only contains positive values, then 
there clearly exists an a £ Aut((Q; <)) such that t 3 = at 2 , and since R is preserved 
by the automorphisms of (Q; <), we are done. Otherwise, let I £ [k] be an index such 
that ti[l] is the largest entry in t\ that is not positive. Because R is shuffle-closed, 
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Figure 10.10. A visualization of (left) and dual-pp (right). 



we know that there exists a tuple t' 3 £ R such that t' 3 [i] < t' 3 [j] iff < ti\l] and 
iiW < or (ti [Z] < h[i], ti[l] < ti\j], and t 2 [i\ < t 2 [j\) for all ij £ [fe]. By 

the definition of pp, and the choice of I, the tuple t 3 satisfies the same property, and 
therefore there exists (3 £ Aut((Q; <)) such that t 3 = (3t' 3 , and hence t 3 £ R. 

For the opposite direction, we assume that R is preserved by pp, and have to show 
shuffle closure of R. Let t\, t 2 be tuples in R, and let I £ [k]. Choose 7 £ Aut((Q; <)) 
such that 7 maps t\ [I] to 0. Then t 3 = pp("fti,t 2 ) is a tuple that satisfies the conditions 
specified in the definition of shuffle-closure. □ 

Due to Proposition 1 10. 5.2} we use the phrase '03 is shuffle-closed' interchangeably 
with '*B is preserved by pp J . The following lemma states an important property of 
shuffle-closed languages that will be used several times in the next subsections. 

Lemma 10.5.3. Let ti, . . . } i; be tuples from a k-ary shuffle- closed relation R, and 
let Mi, . . . , Mi C [k] be disjoint sets of indices such that (J i=1 Mi — [k] and such that 
for all i,j £ [I] with i < j and for all i' £ Mi, j' £ Mj it holds that U[i'] < ti[j'\. 
Then there is a tuple t £ R such that 

• t[i'] < t\j'\ for all i,j £ [I] with i < j and for all i' £ Mi,]' £ M d ; 

• t[i'] < t[i") iffti[i'] < U[i"} for all i £ [I] and all £ Mi. 

Proof. Let . . . , £ Aut((Q; <)) be such that ft maps max{ti[i']\i' £ Mi} 
to 0. We set 

t := pp((3it 1 ,pp(f3 2 t 2 , ■ ■ .,pp(f3i- 1 ti- 1 ,ti) ...)). 
The tuple t clearly belongs to R. 

We prove by induction on / that t satisfies the other conditions of the lemma. 
Observe that f3\ maps all the entries of t\ at M\ to non-positive values. Thus for 
I = 2, it is easy to check from the properties of pp that for each i £ Mi and i' £ M 2 
we have t[i) < t[i'] as required by the statement of the lemma. Also the second 
condition is immediate. For I > 2 let t' be defined by 

f ■= PP((3 2 t 2 ,PP{P3t 3 , ■ ■ ■ iMA-iiii'i) ■ ■ •)) • 

Then we have t = pp(0xt±, t'). Now we apply the same argument as for / = 2. Because 
the order on [fc]\Mi is preserved by the application of pp, we know that the conditions 
are satisfied for the sets M 2 , . . . , Mj. The argument also shows that the entries at Mi 
are smaller than the entries at [k] \ Mi and that their order is the same as in t\. □ 

The following lemma is a simple criterion for showing that certain operations 
generate pp. 

Lemma 10.5.4. Let f be a binary operation preserving < such that for some 
a,/3£ Aut((Q; <)) we have f(x, y) — ax for all x < — 1, < y < 1, and f(x, y) = j3y 
for all x > 1, < y < 1. Then f generates pp. 
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Figure 10.11. Illustration of the operation min. 



Proof. It suffices to show that every relation preserved by / is also preserved by 
pp. Let R be preserved by /, and let t\, t<z be two tuples from R. Let 71 £ Aut((Q; <)) 
be such that 72; = x + 1 for all positive entries x of £2 and 71 x = x — 1 for all other 
entries x of £2- Let 72 <E Aut((Q; <)) be such that all entries of 72i2 are larger than 
and smaller than 1. Then /(7iti,7at2) is in the same orbit as pp(ti,t2), which is 
what we wanted to show. □ 



It is easy to verify that the relation T3 , defined in Section 10.2.3 is shuffle-closed 



Proposition 10.2.7 shows that CSP((Q; S)) is NP-complete, and thus the property of 



shuffle-closure is not strong enough to guarantee tractability. 

10.5.2. Min-union closure. This section introduces and studies a stronger 
property than shuffle-closure, namely preservation under the binary operation min 



that maps two values x and y to the smaller of the two values; see Figure 10.11 for an 
illustration of the operation min. We also present a sufficient condition that implies 
that a temporal constraint language is preserved by min. 

For constraint languages over a finite domain, min- and max-closed relations 



were studied in 119 . An equivalent clausal description of such constraints is known; 
however, the equivalence only holds for finite domains. The tractability of the CSP 
where the constraint language has such a clausal description has also been shown for 
infinite domains 



72 



But the algorithm presented in |72| cannot be applied to all 
min-closed constraint languages over an infinite domain; it is already not clear how 
to adapt this approach to deal with the relation {(x, y,z) \ x > y V x > z}, which is 
preserved by min. In Section 10.5.6 we describe an algorithm that efficiently solves 
the CSP for temporal constraint languages that are preserved by min. 

Definition 10.5.5. Lett be from Q k . The set of indices 
{i £ [k] I t[i] < t[j] for all j e [k]} 
is called the min-set of t, and denoted by M(t). 

Definition 10.5.6. A relation is called min-union closed if for all tuples t\, t^ 
in R there exists a tuple t 3 in R such that M(t 3 ) = M(t\) U Ajf(t 2 ). 

We now want to link min-union closure of the relations in the constraint language 
to the existence of certain polymorphisms. 

Definition 10.5.7. Let f be a binary operation preserving <. We say that f 
provides min-union closure if f (0,0) = /(0, x) = f(x,0) for all integers x > 0. 

The operation min is an example of an operation providing min-union closure. 
The following lemma connects Definition |10.5.6| and Definition |10.5.7| 

Lemma 10.5.8. Let R be a temporal relation preserved by an operation f providing 
min-union closure. Then R is min-union closed. 
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PROOF. Let t\ and ti be tuples in R, and let a\ and <Z2 be the minimal values 
among the entries of t\ and t 2 , respectively. Then there are a\,a 2 € Aut((Q; <)) such 
that a\a\ — 0.20,2 — 0, and such that ot\ and 0.2 map all other entries of t\ and t 2 
to integers. Observe that all entries at M(ti) UM(i 2 ) in the tuple t 3 — f{a\t\, 02*2) 
have the same value. Because / preserves <, this value is strictly smaller than the 
values at all other entries in t 3 . Hence, M(t 3 ) = M{t\) U M(t 2 ). □ 

The following proposition implies that {/, pp} generates min for every operation 
/ that provides min-union closure. 

Proposition 10.5.9. A temporal relation R is preserved by pp and an operation 
providing min-union closure if and only if R is preserved by min. 

Proof. Clearly, min provides min-union closure. Also observe that min satisfies 



the conditions of Lemma 10.5.4 and hence generates pp. 

For the opposite direction, suppose that R is fc-ary and preserved by pp and an 
operation / providing min-union closure. We show that for any two tuples ti,t 2 € R 
the tuple t 3 — min(ti, t 2 ) is in R as well. Let I be the number of distinct values in t 3 
and vi < V2 < ■ ■ ■ < i?j be these values. We define Mj, i G [I], to be the set of indices 
of £3 with the i-th lowest value, i.e., Mj = {j e [k] | t 3 [j] = Vi}. 

Now let ati, . . . ,ai € Aut((Q; <)) be such that cuvi = and such that the en- 
tries of a(t\ and a^2 are integers. Using these automorphisms we define the tuples 
si, . . . , si by Si — f(a>iti,a.it2)- Clearly, these tuples belong to R. It also holds that 
Si is constant at Mi because for each j £ Mi at least one of the entries ti [j] , *2 [j] is 
equal to Vi (the other one can be only greater) which is subsequently mapped to 
by on and / maps all such pairs to the same value. Furthermore, for each j' € Mj/ 
for i < i' < I we have that Si[j'] is greater than the value of Sj at Mj, because 
m!n(f 1 [j'],i 2 [j']) = Vi' is greater than Vi and / preserves <. 

Now we can apply Lemma |l0.5.3| to the obtained tuples s±, . . . , si and the corre- 
sponding sets Mi , . . . , Mi . The lemma gives us some tuple t 3 from R which is constant 
at each set Mi, i < [I], and such that for each i < j < I the value of t' 3 at M, is lower 
than the value of t 3 at Mj. Thus t' 3 has the same order of entries as £3 which shows 
that £3 is in R as well. □ 

10.5.3. Min-intersection closure. In this section, we study a different restric- 
tion of shuffle-closed constraint languages. 

Definition 10.5.10. A relation R is called min-intersection closed if for all tuples 
ti, t% in R, if M(t\) n Mfa) 7^ 0, then there exists a tuple t 3 in R such that M{t 3 ) — 
M{t x )nM(t 2 ). 

Definition 10.5.11. Let f be a binary operation preserving <. We say that f 
provides min-intersection closure if f (0,0) < f(0,x) and f(0,0) < f(x,0) for all 
integers x > 0. 

Lemma 10.5.12. Let R be a temporal relation that is preserved by an operation f 
that provides min-intersection closure. Then R is min-intersection closed. 

Proof. Let t\ and t 2 be two tuples in R such that M(t\) D M(t 2 ) is non-empty, 
that is, it contains an index i. Choose a\,a% € Aut((Q; <)) such that = 
0^2 [«] = 0, and such that a>i and 02 map all other entries of t\ and t 2 to integers. 
Consider the tuple t 3 = f{a\ti, Cfztz). Because at the entries from M(t\) (from M(t 2 )) 
the tuple a\t\ (a 2 f 2) equals 0, and because /(0,0) < f(0,x) and /(0,0) < f(x,0) 
for all positive integers x, it follows that in t 3 all entries at M(ti) n M(t 2 ) have 
a strictly smaller value than all values at the symmetric difference M(t\) A M(t 2 ). 
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Figure 10.12. Illustration of the operation mi. 



Because / preserves <, it also follows that all entries at M(t\) n Mfa) have a strictly 
smaller value than the entries not at M{t\) U Mfe)- We conclude that M{tz) = 
M(ti)r\M(t 2 ). □ 

An example of an operation that provides min-intersection closure is the operation 
mi, defined by 

!a(x) if x < y 
b(x) if x = y 
c(y) if x > y 

where a, b, c are unary operations that preserve < such that 

b(x) < c(x) < a(x) < b(x + e) 

for all x G Q and all < e G Q (see Figure 10.12). Operations a,b,c with these 

be an enumeration of Q. 

, q n } such that 



Clearly, this is 



properties can be constructed as follows. Let gi,(?2, 
Inductively assume that we have already defined a,b,c on {q-i, 
Kli) < c (li) < a (lt) < Hlj) whenever % < q j} for i,j E [n 
possible for n = 1. If q n +i > 1% for ah i £ [n], let qj be the maximum of {q%, . . . , q n }, 
and define a{qj) < b(q n +\) < c(q n+ i) < a(q n +i). In the case that q n +i < qi for all 
i € [n] we proceed analogously. Otherwise, let i, j G [n] such that qi is the largest 
possible and qj is smallest possible such that qi < q n +i < qj- In this case, define 
a(qi) < b(q n+ i) < c(q n+1 ) < a(q n+ i) < b(qj). In this way we define unary operations 
a, b, c on all of Q with the desired properties. 

In fact, the operation mi will be of special importance, because the following 
proposition shows that pp together with any operation providing min-intersection 
closure generates the operation mi. 

Proposition 10.5.13. A temporal relation R is preserved by pp and an operation 
f providing min-intersection closure if and only if R is preserved by mi. 



Proof. It is clear that mi provides min-intersection closure, and Lemma [l0.5.4| 
shows that mi generates pp. 

For the opposite direction, suppose R is /c-ary and preserved by pp and an opera- 
tion / providing min-intersection closure. We show that for any two tuples t\,t2 G R 
the tuple £3 = mi(ii, £2) is in R as well. Let a, b, c be the mappings from the definition 
of the operation mi. Let v\ < • • • < vi be the minimal- length sequence of rational 
numbers such that for each i' G [k] it holds that t 3 [i'] G {Jj e ^{a(vj), b(vj), c(vj)}. 
Let M l be 

{i' G [k] I t 3 [i'\ G {o(«i),6(«i).c(«i)}} ■ 
Observe that for each i' G Mi at least one of t\[i'] and t2[i'} is equal to Vi and the 
other value is greater or equal to Vi. Let Mf be the set of those i' G Mi where 
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Vi = tx[i'] < t 2 [i'], M\ the set of those i! £ M, where v. t = tt[i'] = t 2 [i% and Mf the 
set of those i' £ Mi where Vi = t 2 [i'] < t\ [i'] . 

Let cti, . . . ,ai £ Aut((Q; <)) be such that a« maps v t to and such that the 
entries of a^ti and afa are integers. Let (3 £ Aut((Q; <)) be such that /3/(0, 0) = 0. 
For each i £ [I] we define 



Si := pp{f3f{aiti 1 ait 2 )),pp{ait 2 ,t 1 ) . (16) 

We verify that for all i £ [I] the tuple Sj is constant on each of the sets Mf, Mf, Mf, 
the value at Mf is lower than the value at Mf which is lower than the value at 
Mf. Furthermore, for each j £ [l],j > i, and each i' £ Mi,f £ Mj, it holds that 



Si[i'] < Si\j']. Having this, we can apply Lemma 10.5.3 and obtain a tuple from R 
with the same ordering of entries as in t 3 , which proves the lemma. 

Because on maps Vi to 0, the properties of pp imply that the tuple t[ = pp(oiit 2 , £1) 
is constant at Mf U Mf and at Mf , and the value at the first set is smaller than the 
value at the second set. Because the values of t 2 at A/f U Ui=j+i Mj are greater than 
Vi and the values of t\ at Uj_i+i Mj are also greater than Vi (recall that for each 
j £ [I], j' £ Mj it holds that min(ti\j'],t 2 [j']) — Vj) we conclude that the values of 
at Uj=i+i Mj are greater than those at Mj. 



The application of / in ( 16 ) yields a tuple which is constant on M 8 - and its 



value there (which is consequently mapped to by /3) is smaller than the values at 
Mf U Mf U U 7=1+1 Mj. Thus it is easy to verify from the properties of pp that the 



outer application of pp in ( 16 ) yields a tuple with the desired properties. □ 



Example 10.5.14. An interesting example of a relation that is preserved by mi 
but not by min is the 4-ary relation / defined as follows. 

I (a, b,c,d) =(a = bAb<cAc = d) 
\] {a = bhb> cAc = d) 

V (a = b Ab < c A c < d) 

V (a> bAb> cAc = d) 

To see that I is preserved by mi, let t x and t 2 be two tuples from I. We have to 
show that i3 := mi(ti,t 2 ) £ I. First note that I(a,b,c,d) is equivalent to 

(a > b) A (b ^ c) A (c < d) A (a = b V b > c) A (b < c A c = d) , 

and that mi preserves < and 7^. 

We distinguish the following cases. 

(1) ti[2] < £i[3] and t 2 [2] < 4 2 [3]. Then i x [l] = t x [2] and t 2 [l] = t 2 [2], and hence 
*3[1] = *s[2]. Since mi preserves <, we have £3 [2] < £3 [3]. Since mi preserves 
<, we have that i 3 [3] < t 3 [4], and hence t 3 [l] = t 3 [2] < t 3 [3] < i 3 [4] or 
*3[1] = *3p] < *3[3] = i 3 [4], which proves the claim. 

(2) ti[2] < ti[3] and t 2 [2] > t 2 [3]. Then i x [l] = t x [2] and t 2 [3] = t 2 [A\. We verify 
that £3 satisfies the equivalent characterization of / given above; since mi 
preserves < and 7^, this amounts to proving that £3 satisfies the two clauses 
(a = b V b > c) A (6 < c A c = d). 

The first sub-case we consider is £3 [2] < f 3 [3] . Then by the assumptions 
on £1 and t 2 and by definition of mi we have that ti[2] < £2(2]. Therefore, 
£i[l] = <i[2] < t 2 [2] < t 2 [l] and thus £ 3 [1] = t 3 [2] again by the properties 
of mi; we see that both clauses are satisfied. The second sub-case is that 
£3 [2] > f 3 [3]. Then by the assumptions on t\ and t 2 and by definition of mi 
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we have that ii[4] > ii[3] > i 2 [3] = ^[4]. Thus t 3 [3] = i 3 [4] and again both 
clauses are satisfied. 

(3) t\[2] > £i [3] and i 2 [2] > i 2 [3]. This is analogous to the first case. 

(4) ti[2] > £i [3] and i 2 [2] < i 2 [3]. This is analogous to the second case. 

The relation 7 is not preserved by min since (0, 0, 1, 2) € J and (2, 1, 0, 0) £ 7 but 
mm((0, 0, 1, 2), (2, 1, 0, 0)) = (0, 0, 0, 0) £ 7. □ 

Example 10.5.15. The following ternary temporal relation J7 is preserved by mm 
(we omit the easy proof), but not preserved by mi. 



U(x,y,z) = {x = y Ay < z) 

V (x = z A z < y) 

V (x = yAy = z) 

To see that U is not preserved by mi, note that the tuple mi((0, 0, 1), (0, 1, 0)) 
has three distinct values and hence is not in U, but (0, 0, 1), (0, 1, 0) € t/. An al- 
gorithm that solves temporal constraint languages preserved by mi can be found in 
Section [10331 □ 

We later want to describe the tractable temporal CSPs uniformly by the existence 
of polymorphisms satisfying universal conjunctive sentences. To do so, we need the 
following alternative description of temporal constraint languages preserved by mi. 

Let Q = Qi W Qi be a partition of the rational numbers into two dense subsets, 
and define 

ai(x) if x = y G Qi 
a 2 (x) if x < y and x £ Q\ 
a 3 (y) if x > y and y € Qi 
a 4 (y) if a; = y € Q 2 
a 5 (y) if a > y and y £ Qi 
a&{y) if x <y and x £ Q2 



where a\, . . 
< e G Q 



, as are unary operations that preserve < such that for all x £ Q and all 



ai (a;) < ■ • • < a§ (x) < a± (x + e) . 

Operations oi, . . . , a& with these properties can be constructed similarly as demon- 
strated for mi in Section [10.5.3 For an illustration of g, see Figurc |l"0.13| Red vertices 
are from {(x,x) | x £ Qi}, blue vertices from {(x,x) | x £ Q 2 }. 

We use the binary operation g to construct a binary operation that is commutative 
modulo applying automorphisms of (Q; <) to arguments and function values. 

Proposition 10.5.16. There are automorphisms a,/3 o/(Q; <) and an operation 
f generated by g such that for f(x, y) = af(j3y, fix) holds for all x, y £ Q. 

Proof. By Lemma [3.1.8| it suffices to show that for all finite subsets S of Q 
there exist automorphisms a',f3' of (Q; <) such that g{x,y) = a' g((3'y, f3'x) for all 
x, y £ S. Let {si, . . . , s n } = S with si < • • • < s n . By adding extra elements to S 
we can assume without loss of generality that n is even, s 2 i_i £ Q\, and s 2 i £ Q 2 
for 1 < i < n/2. Then choose /3' to be an automorphism of (Q; <) that sends to 
Si + i, for all i < n. The weak linear order induced by g{/3'y, f3'x) on S 2 is the same as 
the weak linear order induced by g(x,y), and hence there exists an automorphism a' 
that maps g((3'y, /3'x) to g(x, y) for all x,y £ S. □ 



Proposition 10.5.17. 
equivalent. 



Let R be a temporal relation. Then the following are 
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Figure 10.13. Illustration of the operation g. 



(1) R is preserved by the operation f constructed above. 

(2) R is preserved by mi. 

(3) R can be defined by a conjunction of formulas of the form 

0^iei)V .--V {z^x k ) 

V {z>y x )V ■■■V (z>yi) (17) 

V (z > yo) ■ 



PROOF. The implication from (1) to (2) follows from Proposition 10.5.13 since / 
clearly interpolates pp and provides min-intersection closure. Alternatively, observe 
that when e is an endomorphism of (Q; <) whose image is contained in Q 2l then 
f(e(x), e(y)) induces the same weak linear order on Q 2 as mi. Hence, / generates mi. 



The implication from (2) to (3) is an unpublished result from 191 



For the implication from (3) to (1) we verify that / preserves formulas of the form 



as in ( |17| ). Let h := (c, ai, . . . ,a k ,b ,bi, ■ ■ ■ M) and t 2 := (c',ai, . . . ,a' k ,bb,b[, ...,&;) 
be two tuples that satisfy (f)(z, x%, . . . , x k , yo, t/i, . . • , yi)- Suppose for contradiction 
that t := f(ti,t 2 ) = (c", a'{, . . . , <, b' \ b'(, b'{) does not satisfy 0. Then c" = 
a" = ■■■ = a'l. Since f(ux,vi) — fiui^v-i) = f (113^3) implies that either u\ = u 2 = 
U3 or vx = V2 = i>3, and by symmetry in the definition of / we can assume that 
c = ai = • • • = afe. 

It is then impossible that c = 60, since this would imply that c" = b' ' by the 
properties of g and /, contrary to the assumption that ->(j)(to). So there must be an 
i G {0, . . . , k} so that c > b t . Again by the properties of / we find that f(bi, b[) < 
/(c, c'), and we have found a contradiction to ->^>(to). □ 

10.5.4. Min-xor closure. We now introduce the last of the mentioned closure 
conditions. 

Definition 10.5.18. A relation is called min-xor closed if for all tuples t\, t 2 in 
R where the symmetric difference M(t\) A M(t 2 ) is nonempty there exists a tuple t^ 
in R such that M(t 3 ) = M(h) A M(t 2 ). 

Definition 10.5.19. Let f be a binary operation preserving <. We say that f 
provides min-xor closure if f (0,0) > f(0,x) — f(y,0) for all integers x,y > 0. 
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Figure 10.14. Illustration of the operation mx. 



For an example of a binary operation that provides min-xor closure, consider the 
following binary operation, which we denote by mx. 



mx(x, y) 



a(min(x, y)) if x ^ y 
b(x) if x = y 

where b and b are unary operations that preserve < such that a(x) < b{x) < a(x + e) 
for all x G Q and all < e € Q (see Figure 10.14). Similarly as in the definition of 
mi, such operations a, b can be easily constructed. It is easy to see that the operation 
mx neither preserves the relation / nor the relation U introduced in Section [l0.5.3| 

Lemma 10.5.20. Let R be a temporal relation that is preserved by an operation f 
providing min-xor closure. Then R is min-xor closed. 

Proof. Let ti and t 2 be tuples in R, and suppose that the symmetric difference 
M(ti) AM(t 2 ) of M(ti) and M(t 2 ) is non-empty. Let v\ and v 2 be the minimal values 
of the entries of t\ and of t2, respectively. Then there are eti, a 2 G Aut((Q; <)) such 
that a±v\ — and a 2 v 2 = and such that ai and a 2 map all other entries of t\ and t 2 
to integers. Consider the tuple £3 = f{a\t\,a 2 t 2 ). Because ct\t\ is for all entries at 
M(ti), a 2 t 2 is for all entries at M{t 2 ), and /(0,0) > /(0,a;) = f{y,0) for all x, y > 0, 
it follows that in £3 all entries at M(t\) D M(t 2 ) have a strictly larger value than all 
entries at M(t\) A M(t 2 ), which all have the same value. Because / preserves <, all 
entries of £3 at M(ti)P\M(t 2 ) have a smaller value than all entries not at M(ti)\JM(t 2 ). 
We conclude that the tuple t 3 € R satisfies M(t 3 ) = M{t\) A M{t 2 ). □ 

The following lemma implies that {/, pp} generates mx for any operation / that 
provides min-xor closure. 

Proposition 10.5.21. A temporal relation R is preserved by pp and an operation 
f providing min-xor closure if and only if R is preserved by mx. 

Proof. Clearly, mx provides min-xor closure. Lemma |10.5.4| shows that mx 
generates pp. 

For the opposite direction, suppose that R is fc-ary and preserved by pp and an 
operation / providing min-xor closure. We show that for any two tuples t\,t 2 G R 
the tuple £3 = mx(t\, t 2 ) is in R as well. Let a, b be the mappings as in the definition 
of the operation mx. Let v\ < ■ ■ ■ < vi be minimal set of rational numbers such 
that tz[i] € {Jj e [i]{ a {vj),b(vj)} for all i e [k], and let Mi be the set of indices {i' G 
[fc]|i 3 [i'] G {a(vi),b(vi)}}. Observe that for each i' G M 4 at least one of ti[i'} and t 2 [i'] 
is equal to Vi and the other value is greater or equal to Vi . Let M° be the set of those 
i! G Ah where tx[i'] ^ t 2 [i'} and M\ the set of those i' G Mi where v t = t-^i'] = t 2 [i'}. 

Let ct\,...,ai G Aut((Q; <)) be such that ai maps Vi to and such that the 
entries of ont\ and ctit 2 are integers. For each i G [I] we define Si := f(ctiti,ait 2 ). 
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It is easy to see from the choice of a* and properties of / that for each i £ [I] the 
tuple Si is constant at M^,M^, and that the value at M° is lower than the value at 
M\. Furthermore, because / preserves <, because the values of t\ at Uj_i+i Mj are 
greater than Uj, and because the values of t 2 at Uj=»+i are greater than Vi, we 
see that for each j £ > i and each i' £ M i: j' £ Mj, it holds that Si[i'] < Si[j'}. 



Having this, we can apply Lemma 10.5.3 and obtain a tuple from R with the same 



ordering of entries as in t^, which proves the lemma. □ 

Example 10.5.22. An interesting example of a temporal relation that is preserved 
by mx is the ternary relation X defined as follows. 

X(x,y,z) = (x = y A y < z) 

V (x = z A z < y) 

V [y — z A y < x) 

The relation is not preserved by min and by mi: the tuples t\ = (0,0,1), t<i = 
(0,1,0) are in X, but min{ti,t 2 ) = (0,0,0) R, and mi(tx,t 2 ) has three distinct 
entries and hence is not in X as well. □ 

An algorithm that solves constraint languages preserved by mx can be found in 
Section MJM 

10.5.5. Operations generating min, mi, mx. As we have seen in Proposi- 
tion 10.2.7 if the relation T3 has a primitive positive definition in 03, then CSP(03) is 
NP-hard. We show that if a temporal constraint language is shuffle-closed and does 
not admit a primitive positive definition of T3, then it is preserved by min, mi, or mx. 
If the relation T3 does not have a primitive positive definition in 03, then Theo- 
implies that 03 has a polymorphism that does not preserve T3. By Theo- 



5.2.3 



rem |10.3T2"} it suffices to consider operations that preserve <. We start with a sequence 
of auxiliary lemmas. 

Lemma 10.5.23. Let f be a binary operation preserving <, and suppose that 
there is an infinite sequence X\ < X2 < ■ ■ ■ of elements of Q and y\ £ Q such 
that f(x\,yi) > f(x2,yx) < f[xi,y\) for all i > 2. Then f generates an operation 
providing min-inter section closure. 

PROOF. Because / preserves <, we have that for any infinite sequence y\ < 
y 2 < ■ ■ ■ it holds that /(:E2,2/j) > f{x\,yi). Hence, the binary operation defined by 
f(a(x),(3(y)) provides min-intersection closure, where a £ Aut((Q; <)) maps 0, 1, . . . 
to x 2 ,x 3 , . . . and j3 £ Aut((Q; <)) maps 0, 1, 2, . . . to y, yi,y 2 , ■ • • □ 

Lemma 10.5.24. Suppose f preserves < and generates a sequence of operations 
fi, fiT-- such that for each ff. it holds that fk(0, 0) < fk(x, 0) and /jt(0, 0) < /fc(0, x) 
for all integers x £ [k]. Then f generates an operation g providing min-intersection 
closure. 

Proof. A direct consequence of Lemma [8.3. 13| □ 

Lemma 10.5.25. Let f be a binary operation preserving < such that there is an 
infinite sequence x\ < x 2 < ... and yi £ Q satisfying f(xi,y) > f{xj,y\) for all 
1 < i < j . Then {f,pp} generates an operation providing min-intersection closure. 

Proof. By Lemma [10.5. 24[ it suffices to show that there is a sequence of op- 
erations /i,/ 2 , . . . , generated by {f,pp} such that f k (0, 0) < fk{x,0) and f k (0, 0) < 
fk(0,x) for all k > 1 and all x £ [k]. 
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So let k > be a fixed integer, and yi < y 2 < ■ . . be an arbitrary infinite sequence. 
Let ak be from Aut((Q; <))such that 

a{f(xi,yi) | 1 < i < k} U 2/1) | 1 < i < k) C {z 2 , ■ • ■ , 2:2/0} 

and fa, fa € Aut((Q; <)) such that /?i maps 0, 1, 2, . . . to x\,X2, £3, ■ ■ ■ and /?2 maps 
0,1,2,... to 2/1,2/2,2/3, ■ ■ ■ We define 

fk(x,y) := f(a k f(fax,fay),fay) , 

and show that has the required properties. It follows from the assumptions on 
/ that for all positive integers x we have f(fa0, /3 2 0) = f(xi,y) > f{fax,yi) = 
f(fax,fa0), and due to the properties of a k it holds that /k(0, 0) < fk(x,0) for all 
integers x € [&]. 

We also have for every x € [fc] that /32a; > 2/1 and a k f{faQ, fax) > x\. Because / 
preserves <, this shows that fk(0,x) = f{a k f{fa0, fax)), fax) > f{xi,y\). Moreover, 
/ fc (0,0) = f(a k f(xi,yi),yi) < f{xi,y\) by the assumptions on /. Hence, fk(0,x) > 
f(xi,yi)> f k (0,0)iova]ixe[k\. □ 

The following lemma applies (a special case of) Ramsey's theorem; more substan- 
tial applications of Ramsey theory can be found in Section |10.6| 

Lemma 10.5.26. Let f be a binary operation preserving < such that there is an 
infinite sequence x\ < x 2 < ■ ■ ■ and 2/1 € Q satisfying f(x\, 2/1) > f(xi, 2/1) = f(xj,yi) 
for all 1 < i < j . Then {f,pp} generates an operation providing rain-intersection or 
min-xor closure. 

Proof. By the infinite pigeon-hole principle there must be an infinite sequence 
2/2 < 2/3 < ■ ■ • °f elements of Q larger than 2/1 such that 

(1) f{x 2 ,yi) = f(xi,yi) for all i > 2, or 

(2) f{x2,Vi) > f(xi,yi) for all i > 2, or 

(3) f{x 2 ,yi) < f(xi,yi) for all i > 2. 

In case 1, / generates an operation providing min-xor closure and we are done. In 



case 2, we apply Ramseys theorem (Theorem 8.1.1) in the special case of m = 2, 
r = 3 as follows. Let D be {2/1, 2/2, • • • }■ For i < j, define xiiViiVj}) = 1 if 
f{xi,Vi) = /Oi,2/j), x({Vi,Vj}) = 2 if f(xi,Vi) > f(xi,yj), and xiiVuVj}) = 3 



if f{xi,yi) < f{x\,yj). Then Theorem 8.1.1 applied to \ shows that there exists an 
infinite subsequence z\ < z 2 < . . . of 2/1 < 2/2 < ■ ■ ■ such that 

2a. f(xi, Zi) = f(xi, Zj) for all 1 < i < j, or 

2b. f(xi, Zi) > f(xi,Zj) for all 1 < i < j, or 

2c. f{xi,Zi) < f(xi,Zj) for all 1 < i < j. 
In case 2a, we swap arguments of / and proceed as in case 3. In case 2b, we swap 
arguments of /, apply Lemma |10.5.25| and conclude that / generates an operation 
providing min-intersection closure. In case 2c, note that f(xi,y{) > f(x2,yi) > 



f(xi,yi) for all i > 2, and thus we can apply Lemma 10.5.23 to conclude that / 
generates an operation providing min-intersection closure. 

In case 3, we show that similarly as in Lemma |10.5.25| there is a sequence of 
operations fi, / 2 , . . . generated by {/, pp} such that for each fk it holds that /fc(0, 0) < 
fk(x, 0) and fk(0, 0) < /fe(0, x) for all integers x 6 [k], and conclude by application of 
Lemma [l0.5.24| See Figure [T0.15| for an illustration. 

Let a k be from Aut((Q; <))such that akf(x 2 ,yi) — x\ and a{f(x\,yi) \ 1 < i < 
k} C {x 2 , ■ ■ ■ , x k +i}- Furthermore let fa, fa € Aut((Q; <)) be such that fa maps 
0, 1, 2, . . . to xi,X2,Xs, . . . and fa maps 0, 1, 2, . . . to 2/1,2/2, J/3 , • ■ ■ We define 

fk(x,y) := f(a k f(fax,fay),fay) . 
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Figure 10.15. Illustration for Case 3 of Lemma 110.5.261 



Then for all integers x > 

A- (0,0) = f{a k f(x 1 ,y 1 ),y 1 ) = f(x 2 ,y 1 ) , and 

fk(x,0) = f(a k f(l3ix,yi),yi) = /(a^yi) . 

Hence fk(0, 0) < fk(x,0) for all integers x > 0. Finally, since fi 2 x > yi and 
&kf{xi, /3 2 x) > xi for all integers x > 0, we have that fk(0,x) > f(xx,yx) > 

/(z2,yi) = / fe (o,o). □ 

The previous two lemmas are combined in the following result. 

Lemma 10.5.27. Let f be a binary operation that preserves < and violates the 
relation <. Then {f,pp} generates an operation providing min-intersection or min- 
xor closure. 

PROOF. As / violates <, we can without loss of generality assume that there is 
y e Q and Xi,x 2 <E Q, x x < x 2 , such that f{xi,y) > f(x 2 ,y). 
We claim that there are only three possibilities: 

a) There is an infinite sequence X3 < x^ < . . . such that x 2 < x% and f(xi, y) > 
,f(x 2 , y) for all i > 2. 

b) There is an infinite sequence x^ < x^ < . . . such that x 2 < X3 and f(xi, y) > 
f(xj,y) for all 2 < i < j. 

c) There is an infinite sequence X3 < X4 < . . . such that x 2 < X3 and f(xi, y) = 
f{x 2 , y) for all i > 2. 

To show this claim, observe that by the infinite pigeon-hole principle there is an 
infinite sequence x$ < X4 < . . . with x 2 < x 3 such that f(xi,y) > f{x 2 ,y) for all 
i > 2, f{x' i ,y) = f(x 2 ,y) for all i > 2, or f(xi 7 y) < f(x 2 ,y) for all i > 2. In the first 
and the second case the claim holds. In the third case, we repeat the argument with 
x 2 < X3 instead of x\ < x 2 . Again, we distinguish three cases, and as before in two of 
them we are immediately done. In the third case, we repeat again. If we repeat this 
for infinitely many times we obtain a sequence X3 = x' 3 < x\ < . . . such that x 2 < X3 
and f{x[,y) > f(x' j: y) for all 2 < i < j. 



In a) the conditions of Lemma 10.5.23 are satisfied and we conclude that {/, pp} 



generates an operation providing min-intersection closure. In b) Lemma 10.5.25|shows 



that {/, pp} generates an operation providing min-intersection closure. In c) we ap- 



ply Lemma 10.5.26 and conclude that {/, pp} generates an operation providing min- 



intersection or min-xor closure. □ 

The following is the main result of this subsection. Recall that the relation T3 
was defined in Definition 110.2.61 to be 

{(a, y, z) € Q 3 I (x = y < z) V (x = z < y) } . 
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Lemma 10.5.28. Let f be a binary operation that preserves < and violates the 
relation T3. Then {f,pp} generates min, mi, or mx. 



PROOF. By Proposition 10.5.9 10.5.13 and 10.5.21 it suffices to show that 



{/, pp} generates an operation providing min-intersection, min-union, or min-xor clo- 
sure. If / violates <, then we are immediately done by Lemma [i0.5.27| So we further 
assume that / preserves <. 

Because / preserves < and violates T 3 , we can assume without loss of generality 
(possibly after swapping arguments) that there are x\, X2, yx, y2 S Q such that x\ < 
X2, Vi < 2/2 and t := (f{xi,yi),f(x 2 ,yi),f(xi,y 2 )) & S. Because / preserves < we 
have that f(xi,y\) < f(x 2 ,yi) and f{x\,y\) < f(x\,y2). Since t £ 5, there are only 
two possibilities: 

(1) t[l] < t[2] and t[l] < t[3]. In this case, choose infinite sequences X3 < X4 < 
. . . and 2/3 < 2/4 < . . . such that X2 < X3, y 2 < J/3- Because / preserves <, we 
have for all i > 1 that f{x 2 ,yi) < f(xi,yi) and f(xi,y 2 ) < f{x\,yi). Since 

= f( x i,Vi) < t[2] = f(x 2 ,Vi) we have that f{x 1 ,y 1 ) < f{Xi,yi) for all 
i > 1, and since t[l] = f(xx,y%) < t[3] = f{x\,y2) we have that f(x\,yi) < 
f(xi,yi) for all i > 1. Hence, / provides min-intersection closure. 

(2) t[l] = t[2] = <[3]. In this case we can choose infinite sequences x' 2 < x' 3 < . . . 
and y' 2 < y' 3 < . . . such that x\ < x' 2 , y± < y' 2 , and for all i > 1, x^ < x 2 and 
Vi < V2- As / preserves <, we see that = f{x 1 ,y 1 ) = /(x^y't) for 
all z > 1 and thus / provides min-union closure. 

□ 

10.5.6. Algorithms for shuffle-closed languages. In this section we present 
three algorithms, for the languages preserved by mi, by mm, and by mi, respectively. 
All three algorithms follow a common strategy. They are searching for a subset of 
the variables that can have the minimal value in a solution. If they have found such 
a a subset, S, the algorithms add equalities and inequalities that are implied by all 
constraints under the assumption that the variables in S denote the minimal value 
in all solutions. Next, the algorithms recursively solve the instance consisting of the 
projections of all constraints to the variables that do not denote the minimal value 
in all solutions. We later show that for languages preserved by pp it is true that 
if the instance has a solution, it also has a solution that satisfies all the additional 
constraints. 

Throughout this section we assume that 05 is a structure with a first-order defini- 
tion in (Q; <) and a finite relational signature. For the formulation of the algorithms 
and their correctness proofs it will be convenient to work with an expanded con- 
straint language, that contains the binary relation = for the equality relation. We 
also add to the temporal constraint language 05 several other temporal relations that 
are primitive positive definable in ( S. 

Definition 10.5.29. Let R be an n-ary temporal relation and L = {pi, . . . ,pk} C 
[n] where Pi < • ■ • < Pk- Let {qi, . . . , q{\ be [n] \ L. Then the ordered projection of R 
to L is the k-ary relation R' with the primitive positive definition 

R {^pi 7 ■ ■ ■ 7 %Pk ) = J • ' ■ 7 ^ qi - R( x 1 7 • • • ) *^n) A Xi <C Xj . 

i£[n]\L, j£L 

Note that if 25 is a finite temporal constraint language, then there are only finitely 
many projections and ordered projections of relations in 58. In case that there is a 
primitive positive definition of < in 23, ordered projections are primitive positive 
definable. By Lemma [l.2.6| we can assume in this case that 03 contains all relations 
that can be defined by ordered projections from relations in 23. 
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To formally introduce our algorithms, we also need the concept of an ordered 
projection of instances of the CSP. 

Definition 10.5.30. Let 23 be a temporal constraint language that contains all or- 
dered projections of relations from < B . Let<& be an instance o/CSP(23) and X C V(&). 
Then the ordered projection of <& to X is the instance of CSP(23) that contains for 
each constraint R(xi, . . . , x n ) in <!>, with not necessarily distinct variables x\, . . . , x n , 
the constraint i?'(xfe x , . . . , Xk t ) where fci < ■ ■ ■ < fc; are such that {ki,...,ki} = {k G 
[n] | x k G X}, and R' is the ordered projection of R to {ki, . . . , k{\. 

Let $ be an instance of a temporal CSP. 

Definition 10.5.31. If ip = R(xi, . . . ,Xk) is a constraint from then a subset 
X of the variables of ip is called a min-set (of ip) if there exists a k-tuple t satisfying 
ip such that x € X iff the value for x in t is the minimum of all entries of t. A 
set of variables S C V(&) is called free iff it is non-empty and for all constraints 
R(x\, . . . ,Xk) in $ the set S fl {xi, . . . ,Xk} is either empty or a min-set of R. 

We will show how to use the concept of freeness to solve instances of CSP(23) for 
shuffle closed temporal constraint languages. 

Lemma 10.5.32. Let $ be an instance o/CSP(23) for some shuffle closed 23, and 
let S be a free set of variables o/3>. Then $ has a solution if and only if the ordered 
projection $' of $ to V(&) \ S has a solution. 

PROOF. First suppose $' has a solution s'. Let ip = R(x\, . . . , x m ) be a constraint 
of $ such that V(ip) n S = {x pi , . . . ,x Pk } ^ 0. Let {x qi , . . . ,x qi } = V(tp) \ S for 
qi < ■ ■ ■ < qi- By the definition of an ordered projection, there is a tuple t\ e R 
such that s'(xi) = ti[i] for all i e {qi, . . . ,qi}. Since V(ip) n 5* is a min-set of R, 
there is a tuple t 2 G R such that M(t 2 ) = {pi, ■ ■ ■ ,Pk}- Let a <G Aut((Q; <)) be such 
that a maps the minimal value of t 2 to 0. Because R is preserved by pp, the tuple 
t 3 := pp{a(t 2 ),ti) is in R. It is easy to verify that M(t 3 ) = {pi, . . . ,p^} and that 
there is (3 e Aut((Q; <)) such that fit^i] = s'(xi) for i e {qi, . . . , q{\. Because we can 
find such a tuple for all the constraints ip in $ where V{ip) n S ^ 0, we conclude that 
a solution s' of $' can be extended to a solution s of $ by setting all the variables 
in S to some value that is smaller than the smallest value in {s'(x) \ x e V($>')}. 
Clearly, all the constraints ip in $ with V(ip) n S = or V(ip) C S are satisfied by s 
as well. 

Now suppose that $ has a solution s. Let xi,...,x n be the variables of $, 
and let {x ri , . . . ,x r[s[ } be S. Let s' be a mapping from V(&) to Q such that 
M((s'(xi),...,s'(x n ))) = {ri,...,r ]sl }, and s'(x) = s(x) for x G V(&)\S. We 
claim that s' is a solution for 3>'. Let ip — R(yi, ■ ■ ■ , y m ) be a constraint of <& such 
that V(ip) n S 7^ 0. Clearly, t\ := (s(yi), . . . , s{y m )) is in R since s is a solution 
of $. Let {y Pl , . . . , y Pl } be Sfl {j/i , . . . , y m }. Since {y Pl , . . . , y Pl } is a min-set of R, 
there is a tuple t 2 G R such that M(t 2 ) = {pi, . . . ,pi}. Let a G Aut((Q; <)) be such 
that a maps the minimal value of t 2 to 0. Because R is preserved by pp, the tuple 
t 3 := pp{at 2 , ti) is in i?. It is easy to verify that M(t 3 ) — {pi, . . . ,p{\, and that there 
is an automorphism (3 such that (3t^[i] — s(yi) for i G \m] \ {pi, . . . ,pi}. Clearly, the 
restriction of s' to V(&) \ S is a solution to the ordered projection of <I> to V($) \ S 
since s' also satisfies all the inequalities imposed by the ordered projection. Therefore 
$' is satisfied by s'. □ 

The above lemma asserts that if we are able to identify a free set for instances 
of CSP(*8) for a shuffle-closed temporal language 23 in polynomial time, then we 
also have a polynomial time algorithm that solves CSP (25). The running time of the 
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Solve($) 

// Input: An instance $ of CSP(SS) 

/ / for a shuffle closed temporal language 23 

/ / Output: A solution s to $, or reject if there is no solution. 

i := 

while V($) / do begin 
S := FindFreeSet($) 
if S = false then reject 
for each x £ S do s(a;) := £ 
£ := £ + 1 

$ := ordered projection of$tol / ($)\S' 

end 

return s 



Figure 10.16. An algorithm that efficiently solves instances of a 
shuffle closed constraint language if free sets can be computed effi- 
ciently. 



algorithm is 0(n ■ (m + t(n,m))), where n — \V\, m is the number of constraints in 
$, and t(n,m) is the running time of the procedure that computes the free set of an 
instance with n variables and m constraints. 

An algorithm for languages preserved by min. Now, we concentrate on the 
problem to find a free set of $ if 23 is preserved by the operation min. 

Let tjj = R(xi, . . . , Xk) be a constraint where R is from *B and let L be a subset of 
{xi, . . . , Xk}- Let At, . . . , Ai be all min-sets of ip that are contained in L. When I > 1, 
i.e., when such min-sets exist, there is a unique set Aj, j € [I], with the property that 
Ai C Aj for all £ € [Z], because R is preserved by min, and thus min- union closed by 
Lemma |l0.5.8| We call this min-set the maximal min-set of ip contained in L. Note 
that for some L it could be that I = 0, i.e., L does not contain min-sets of R. 



Figure 10.17 shows our procedure for finding a free set for a min-union closed 
constraint language. It is straightforward to check that the procedure FindFreeSetUC 
has a running time 0{nm), where n is the number of variables and m is the number 
of constraints of $. 

Lemma 10.5.33. The procedure FindFreeSetUC in Figure {i0.1T\ returns a free set 
of or rejects. If it rejects, $ is unsatisfiable. 

Proof. Suppose that the algorithm returns a (non-empty) set S. Then recheck 
must be set to false. Therefore, for all constraints R{x\, ...,Xk) of $ such that S n 
{xi, . . . , Xk} 7^ the maximal min-set of ip contained in S equals S n {xt, ■ ■ • , 
We conclude that S is a free set of 

We now have to argue that in case that $ is satisfiable, the algorithm does not 
reject (i.e., it finds a free set). If $ has a solution, there is some set S' of variables that 
have the minimal value in this solution. At the beginning of the procedure, S is set 
to V and therefore S' C S. We show that S' C S during the entire execution of the 
procedure. Let ip = R{xi, . . . , Xk) be a constraint from <E>. Because S" n {x\, . . . , Xk} 
is a min-set of ip that is contained in S, the maximal min-set of ip added to S \ 
{xi, . . . , Xk} certainly contains S' H {x\, . . . , Xk}- Therefore, after the modification to 
S it still holds that S D S' . When the procedure terminates, it returns the set S, 
because ^ S' C S. □ 

Theorem 10.5.34. 7/55 is preserved by min there is an algorithm solving CSP(Q5) 
in time 0(n 2 m). 
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FindFreeSetUC($) 

// Input: An instance $ of CSP(Q3) with variables V 

II for a temporal constraint language 23 preserved by min. 

II Output: A free set 5* C V of $, or reject. 

//If the algorithm rejects, $ is unsatisfiable 

S := V 

recheck := true 
while recheck do begin 
recheck := false 
for all ip £ <& do begin 

if S n V(V0 / then begin 

S := (S\ V{ip)) U the maximal min-set of ip contained in 5* (~1 V(ip) 
if S changed then recheck := true 

end 

end 

end 

if S / then return S 

else reject 

end 



Figure 10.17. A polynomial time algorithm that computes free sets 
for constraint languages preserved by min. 

PROOF. We use the procedure FindFreeSetUC in Figure [T0.17| for the subroutine 
FindFreeSet in Figure |10.16| Then Lemma |10.5.32| and Lemma |10.5.33| imply the 
correctness of the resulting algorithm. □ 

An algorithm for languages preserved by mi. In this section we describe 
how to find free sets in instances of CSP(25) for languages 03 that are preserved by 
mi. We define the notion of a minimal min-set: Let ip = R(%i, ■ ■ • , Xk) be a constraint 
from an instance $ of CSP(Q3), and let L C {x\, . . . , Xk}- Let A\, , . . , A\ be all min- 
sets of "0 that contain L. Because R is preserved by mi, and thus is min-intersection 
closed by Lemma 10.5.12[ there is a min-set Aj of ip that is a subset of every min-set 



containing L. We call Aj the minimal min-set of R containing L. 

The procedure for finding a free set for min-intersection closed constraint lan- 
guages is given in Figure |10.18| It is straightforward to verify that the above algo- 
rithm runs in time 0(n 2 m) where n is the number of variables and m is the number 
of constraints in $. 



Lemma 10.5.35. The procedure FindFreeSetIC in Figure 10.18 returns a free set 
S o/$, or rejects. If it rejects, $ is unsatisfiable. 

Proof. Suppose that the algorithm returns a set S. The variable correct must 
then be equal to true. When the while loop terminates, recheck equals false, and so 
for all constraints ^ g $ such that V(ip) n S ^ the set S did not change. This 
implies that for all these constraints the minimal min-set of ip containing S (1 V{tp) is 
equal to S fl V(ip). We conclude that S is a free set of $. 

We now have to argue that in case that $ is satisfiable, the algorithm does not 
reject. If $ has a solution, then there is some set S" of variables that have the minimal 
value in this solution. Consider a run of the while loop in the procedure FindFreeIC 
for some variable x G S' . In the beginning, it holds that S = {x} C S' . For each 
constraint tp from $ we have that S'nV(ip) is a min-set of ip if S'P\V(il>) 1S non-empty. 
Therefore, the program variable correct cannot be set to false while S C 5". Because 
we always add only variables of the minimal min-set of ip containing S n V{ip) to 5", 
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FindFreeSetIC(<!>) 

/ / Input: An instance <!> of CSP(23) where 25 is preserved by mi 
II Output: A free set S C V(®) of $, or reject 
//If the algorithm rejects, 3> is unsatisfiable 
for all x G V($) do begin 
S:={x} 

recheck := true; correct := true 
while recheck A correct do begin 
recheck := false 

for all constraints ip of 5> such that (V(tl>) fl S) 7^ do begin 

if there is no min-set of ip containing S n V(tp) then correct := false 
else begin 

5 := S U the minimal min-set of if) containing S n V(V>) 
if 5 1 changed then recheck := true 

end 

end 

end 

if correct then return S 

end 
reject 



FIGURE 10.18. A polynomial time algorithm that computes free sets 
for mill-intersection and shuffle closed constraint languages. 



all these variables are always in 5". Therefore, S remains a subset of S' all the time, 
and the algorithm docs not reject. □ 

Theorem 10.5.36. is preserved by mi there is an algorithm solving CSP(Q5) 
in time 0(n 3 m) . 



Proof. We use the procedure FindFreeSetIC in Figure 10.18 for the sub-routine 



FindFreeSet in Figure 10.16 Lemma 10.5.32 and Lemma 10.5.35 imply the correct- 
ness of these algorithms. □ 

An algorithm for languages preserved by mx. Finally, we consider languages 
*B preserved by mx. Let R be a relation from 03. For a tuple t € R, we define Xmin(t) 
to be a vector from {0, l} k such that Xmin(t)[i] = 1 if and only if t[i] is minimal in 
t. We define Xmin(R) to be {xmin(t) | t € R}. Since R is preserved by mx and 
hence min-xor closed by Lemma 10.5.20 the set Xmin(R) is closed under addition 



of distinct vectors over GF(2), and hence in particular clos ed un der the Boolean 
minority operation minority(x, y,z) — x © y © z. By Theorem 5.4.3 Xmin(R) U {0 fe } 
is exactly the set of solutions of a system of linear equations. 



Theorem 10.5.37. I/*B is preserved by mx there is an algorithm solving CSP(5$) 
in time 0(n 4 ). 

Proof. To find a free set of variables of an instance $ of CSP(Q3) (if it exists), we 
first construct a system S of linear equations over GF{2) with variable set {x v \ v € V} 
and linear equations as described above for each constraint in $. It is well-known that 
a solution of S that is distinct from k can be computed in cubic time (by Gaussian 
elimination). If there is such a solution, then the set of variables mapped to 1 is a 
free set of $. If the system has no such solution, then there is no free set of variables, 
and there is no solution for <F Now the claim follows from Lemma 110.5.321 as in 
Theorem |H)JL34| and Theorem MJM □ 
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10.6. Classification 

This section combines the previous results to show that every temporal constraint 
language has a polynomial-time constraint satisfaction problem, or is NP-complete. 

Lemma 10.6.1. Let f be a binary operation violating Betw and preserving <. Then 
there are t±,t 2 G Betw such that f(ti,t 2 ) has three distinct entries and f(ti,t 2 ) ^ 
Betw. 

PROOF. Since / violates Betw, there are two triples ti,t 2 G Betw such that 
t := f(ti,t 2 ) g" Betw. Because / preserves <, we can assume without loss of generality 
that ti [1] < <i[2] < ti[d] and t 2 [l] > t 2 [2] > t 2 [3}. If t has three distinct entries (in 
this case, we also say that t is injective), we are done. Otherwise we distinguish two 
cases: 

(1) t[l] = t[2] = t[3]: In that case, take a triple s\ such that si[l] < ti [1], 
si[2] — ti[2], and si[3] = ii[3]. We also choose a triple s 2 such that t 2 [2] < 
s 2 [l] < t 2 [l], s 2 [2] — t 2 [2], and s 2 [3] = t 2 [3]. It is straightforward to check 
that si[l] < si[2] < si[3] and S2[l] > s 2 [2] > s 2 [3] and thus both triples 
belong to Betw. Now, consider s := f(si,s 2 ). We have that s[2] = t[2], 
s[3] = i[3], and s[l] < t[i\ — s[2] = s[3] because / preserves <. Therefore 
s G" Betw. Take s\ instead of t\, s 2 instead of t 2 and proceed with case 2. 

(2) If exactly two entries in t have the same value, let i, j be their indices and let 
k be the index of the entry with the unique value. We assume that t[k] > t[i] 
(the other case is symmetric). It is straightforward to verify that there is 
an entry in t such that making the value of this entry smaller would make t 
injective and it would still not be in Betw. We can assume without loss of 
generality that i is an index of such an entry. We choose si so that si[i] < 
£i [z] , si[j] — ti[j], si[k] — ti[k], and si[l] < si[2] < Si[3]. We choose s 2 such 
that s 2 [i] < t 2 [i], s 2 [j] = t 2 [j), s 2 [k] = t 2 [k], and s 2 [l] > s 2 [2] > s 2 [3}. Note 
that si,s 2 G Betw. The tuple s := f{si,s 2 ) satisfies s[i] < t[i], s[j] = t[j], 
and s[k] = t[k]. By the choice of i we conclude that s is injective, s $ Betw 
and we are done. 

□ 



We use Ramsey theory via Theorem 8.3.17 to prove the following 



Lemma 10.6.2. Let f be a binary operation that preserves < and violates Betw. 
Then f generates 11, dual-ll, pp, or dual-pp. 



Proof. If / violates Betw and preserves <, then Lemma 10.6.1| asserts that there 



areii,i 2 € Betw such that t := f(ti, t 2 ) £ Betw and t is injective. As / preserves <, we 
can assume without loss of generality that £i[l] < ii[2] < ti[3] and t 2 [l] > t 2 [2] > t 2 [3] 
(otherwise, we apply the argument to f{y,x)). 

Either the triple t satisfies t[l] > t[2] < t[3] or t[l] < t[2] > t[3]. In the first case, 
let Si := {x e Q \ h[l] <x< h[2]}, S 2 :={xeQ \ h[3] < x}, T x := {y e Q | * 2 [3] < 
y < t 2 [2]}, and T 2 := {y e Q | t 2 [l] < y}. In the second case, let Si := {x G 
Q | h[2\ <x< ii[3]}, S 2 := {x e Q \ x < Ml]}, 2\ := {y G Q | t 2 [2] < y < i 2 [l]}, 



and T 2 := {y G Q | y < £ 2 [3]}. See Figure 10.19 for an illustration of these sets 



For each k G N, we define sets S[ k \ T[ k \ S 2 k \T 2 k ^ as follows. Apply Lemma 



8.2.8 



to the grid Si x T± (both Si and T x are infinite and in particular larger than R(R(fe))), 
and obtain subsets C Si and C T x such that \U {k \ > R(kY_[V^\ > R(fc), 



and / is canonical on x V^ k '. Similarly, we apply Theorem 8.2.8 to the grid 
[/W x T 2 and obtain subsets S [k) C C/( fe ) and T 2 (fe) C T 2 of cardinalit y at least k 

(k) (k) I I 

such that / is homogenous on SI ' x Tf'. We finally apply Theorem |8.2.8| to the 
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h[2] h[S\ *i[l] h[2] h[3] 



FIGURE 10.19. Grids chosen for the application of the product Ram- 
sey theorem. The depicted ordering on the values of / follows from 
the choice of t\ , t 2 and because / preserves < . 



(k) (k) 

k such that / is canonical on S 2 X T\ . Note that / is in particular canonical on 

g(k) x T (k) 



grid S 2 x V {k) and obtain subsets S { 2 k) C S 2 and T {k) C V {k) of cardinality at least 

10 

There are just 6 3 possibilities for how / behaves on those grids for given k. Hence, 

(k) (k) 

there is an infinite set K C N such that / behaves in the same way on S{ x If 
for all k £ K, in the same way on Si x T 2 for all k G K, and in the same way on 

g(k) x T (k) for an k e R 

The following observations will be obvious by inspection of Figure 10.19| left 
side. In case that s[ k ^ is before S 2 (that is, all elements in s[ are smaller than all 
elements in 5 2 fc) ) and I\ (fc) is before T 2 (fc) , then by the choice of S[ k) , 5 2 fe) , T[ k \ and 
T 2 , and because / preserves <, we have 

f(x,y) < /(ti[2],t2[2]) < f(h[l]Ml]) < fW'V') 

for all (x,y) G Sf 5 X T\ (fe) and (ar',y') € (S[ k) x T 2 (fe) ). Similarly, 

< }{h[2],t 2 m) < /(ti[3],t2[3]) < f(x",y") 

for all (x,y) G s| fe) x T 1 (fe) and € (sf x T x (fe) ). The other case i s that 

for an 

(fc) 



10.19 

w 



S 2 k ^ is before S[ k ^ and is before (see the right side of Figure 
illustration). In this case f(x, y) > f{h[2],t 2 [2}) > f(x', y') for all (a;, y) G S\ K> x T[ 
and (x',y') G {s[ k) x if*) U (S< fc) x if*). 

First suppose that / is dominated by the same argument on all the grids S[ x 
T\ (fe) , S {k) x T 2 (,c) , and 5^ fc) x I\ (fc) for all fc G K. We can assume that / is dominated 
on these grids by the second argument; otherwise we swap the arguments of /. Let 

(k) (k) 

g,h £ {leXy jXl lex y ^ x ,py\ be such that / behaves like g on SJ x Tj and like h on 
S 2 k ^ x . Then by the above observations and local interpolation / generates [g\h] 
if Si is before S 2 , and [/i|<?] if S2 is before Si. Moreover, we show that / also generates 
lex. 

• If g or h is lex xy or leXy yX , then / clearly generates Z&r. 

• If g or /i is lex Xt _ y or leXy _ x , then / generates /ex as well, as lex(x, —lex(x, —y)) 
behaves like lex(x,y). 



If 5 is p y and /i is then / generates lex by Lemma 10.4.4 
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Note that the operation [g\h] satisfies the conditions in Lemma 10.4.3 and hence 



{lex, [g\h]} generates [lex ytX \lex VtX }. By Lemma 10.4.4 / generates 11. 



Now we consider the case that / is dominated by different arguments on the grids 
s[ k) x T[ k) and s[ k) x T 2 (fc) , or by different arguments on the grids s[ k) x T 1 (fc) and S^ k) x 

(k) 

, for all k <G K. We only consider the first case; the second case is symmetric under 
swapping the arguments of /. Let g,h be from {lex x ^ y , lex x - y , lex y _ x , lex y ^ x ,p x ,p y ] 
such that / behaves like h on the grids S[ k ' x and like g on the grids x T[ k \ 
Again, by local interpolation / generates [h\g] if Si is before 52, and [g\h] if S2 is 
before Si- We assume without loss of generality that / generates [h\g] (in the other 
case we can exchange the names of h and g and proceed in the same way). 

If h is p y and g is p x , then [h\g] behaves like pp; hence / generates pp and we are 
done. Dually if h is p x and g is p y , then / generates dual-pp. In all other cases, either 
h or g is from lex x<y , lex VtX , lex x> _ y , or leXy x , and thus / generates lex as we have 
already seen before. But then Lemma [l . 4 . 3| shows that / generates 11 or dual-ll. □ 



in P 




Figure 10.20. An illustration of the classification result for tempo- 
ral constraint languages. Double-circles mean that the corresponding 
operation has a dual generating a distinct clone which is not drawn 
in the figure. 



We summarize our findings in the following classification statement; also see Fig- 
ure fTu^DI 

Theorem 10.6.3. Let 23 be a temporal constraint language. Then one of the 
following applies. 

• 58 is preserved by at least one of the following nine operations: 11, min, mi, mx, 
their duals, or a constant operation. 

• There is a primitive positive definition of Betw, Cycl, Sep, T3, —T3, or Eq 
in 05. 



PROOF. Theorem 10.3.2 asserts that one of the following cases is true: 
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(1) There is a primitive positive definition of Cycl, Betw, or Sep in 03. 

(2) Pol(03) contains a constant operation. 

(3) Pol(03) contains all permutations of 



In this case, Theorem 6.3.3 shows 
that 03 either has a binary injective polymorphism g, or the relation Eq has 
a primitive positive definition in 03. In the first case, by composing g with 
a permutation, we see that all binary injective operations preserve 03, and 
hence in particular the relation 11 is a polymorphism of 03. 
(4) there is some binary / g Pol(03) that preserves <. 

We are done in all cases except the fourth; in this case Lemma 10.6.2 implies that 
the operation / generates pp, dual-pp, 11, or dual-ll. If / generates 11 or dual-ll there 
is nothing to show. If / generates pp then Lemma |10.5.28| shows that either T3 has 
a primitive positive definition in 03, or 03 is preserved by min, mi, or mi. Dually, if 
/ generates dual-pp then either — T3 has a primitive positive definition in 03, or 03 is 
preserved by one of the duals of min, mi, or mx, which completes the proof. □ 

With the previous theorem it is easy to obtain the full complexity classification 
for temporal constraint satisfcation problems, and finally show Theorem|10.1.1| 



Proof of Theorem 110. l.ll We first show that when 03 is preserved by 11, min, 
mi, mx, one of their duals, or the constant operation, then 03 is also preserved by 
an operation / satisfying f(x,y) — af {fly, fix) for a,fi G Aut((Q; <)). The opera- 
tion min is idempotent and commutative, so there is nothing to show in this case. 
The operation mx is commutative; and by Lemma |3.1.8| mx generates a function / 
satisfying f{x,y) = otf{y,x) for an automorphism a of (Q; <). 

When 03 is preserved by mi or by 11 we have shown in Proposition |10.5.16| and 
Proposition 10.5.16 that 03 is also preserved by a binary function / satisfying f(x, y) = 
af(fiy, fix) for some automorphisms a, fi of Q. 

Now let 03' be a finite signature reduct of 03. If 03' is preserved by a constant 
operation, then tractability of CSP(03') follows from Proposition 1.1.11 For the case 



that 03' is preserved by 11 or dual-ll we have presented a polynomial-time algorithm 
for CSP(03') in Theorem 10.4.18 If 03' is preserved by min, mi, mx or one of their 



duals, tractability of CSP(03') is shown in Section 10.5.6 



Now suppos e that 03 is not preserved by one of the listed operations. Then by 



Theorem 



10.6.3 



we know that one of the relations Betw, Cycl, Sep, T3, — T3, or Eq 



has a primitive positive definition in 03. Each of those relations together with finitely 
many constants primitively positively interprets ({0, 1}; 1IN3): 

• For (Q; Betw, 0) a primitive positive interpretation of ({0, 1}; NAE) has been 
shown in Proposition |5 . 5 . 14| which also gives a primitive positive interpreta- 
tion of ({0, 1}; 1IN3) in (Q; Betw, 0) via Theorem \5l>A8\ 

• a primitive positive interpretation of ({0, 1}; 1IN3) in (Q; Cycl) has been 
given in Theorem 10.2.8| 

• The structure (Q; Sep, 0, 1) primitively positively interprets ({0, 1}; 1IN3) by 
Proposition [10375] 

• The structure (Q;T3,0) primitively positively interprets ({0, 1}; 1IN3) by 
Proposition 10.2.7] the proof for — T is dual. 



E; 



5.5.10 



j 6 primitively positively interprets ({0, 1}; 1IN3) by Proposition 

Finally, by Proposition |5.6.10| it follows that the two cases in the statement of Theo- 
rem [HEO] are distinct. □ 



See Figure |10.21| for an overview over the nine largest tractable temporal con- 
straint languages; the entries also mention typical relations for the respective lan- 
guage, i.e., a set of relations that is contained in the language, but not contained in 
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any other of the nine languages - hence, these relations show that all the languages 
are distinct. 



Polymorphism 


Typical Relations 


Complexity 


Reference 


min 


{U,<} 


0(n 2 m) 


Theorem 


10.5.34 


mi 


{1} 


0(n 3 m) 


Theorem 


10.5.36 


mx 


{X} 


0(n 4 ) 


Theorem 


10.5.37 


max = dual min 


{-u,<} 


0(n 2 m) 






dual mi 


{-1} 


0(n 3 m) 






dual mx 




0{n A ) 






11 


{(u ^ v) V (x > y) V (x > z)} 


O(nm) 


Theorem 


10.4.18 


dual 11 


{(it ^ v) V (x < y) V (x < z)} 


O(nm) 






constant 


{(x < y < z)y (z < y < x)} 


0(m) 







Figure 10.21. Summary of the various tractable languages. For 
the last three operations, the typical relations are given by their 
first-order definition; in all other cases, see Section fl0.5| 



We want to remark that the so-called meta-problem for tractability is decidable; 
this is formally stated in the following corollary. 

Corollary 10.6.4. There is an algorithm that, given quantifier-free first-order 
formulas <f>\, ...,(/>„ that define over (Q; <) the relations R±, . . . , R n , decides whether 
CSP((Q; Ri, . . . , Rn)) is tractable or NP-complete. 



a 



CHAPTER 11 

Non-D ichot omies 
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There are basically two methods for proving that a subclass of NP does not have 
a complexity dichotomy. The first is to show that for every problem in NP there is a 
polynomial-time equivalent problem in the subclass. By polynomial-time equivalent 
we mean that there are polynomial-time Turing reductions between the two problems. 
The non-dichotomy result then follows from Ladner's theorem 
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which asserts 

that there are problems in NP that are neither in P nor NP-compIete, unless P=NP. 
This method has been applied to show that, for example, the class of monotone 
SNP does not exhibit a complexity dichotomy 89 . We will apply this technique in 



Section 11.1 and Section 11.2 to give two different proofs of the fact that the class of 
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all constraint satisfaction problems with infinite domains does not have a complexity 
dichotomy. 

The second technique to show a non-dichotomy is to directly use Ladner's proof 
technique, which is sometimes called delayed diagonalization. We will use this method 



in Section 11.3 to show that there are w-categorical structures 23 such that CSP(23) 
is in coNP, but neither in P nor coNP-complctc (unless P=coNP). The question 
whether there are w-categorical structures 23 such that CSP(23) is in NP \ P but not 
NP-complete is still open. 



This chapter contains results from [31] (in Section 11.3) as well as previously 
unpublished results. 

11.1. Arithmetical Templates 

In this section we show that for every computational decision problem there exists 
a polynomial-time equivalent constraint satisfaction problem with an infinite template 



23. This result was first shown in 31 . Here we present a new proof that uses 
Matiyasevich's theorem. In fact, we prove a stronger result, namely the existence of 
a single structure £ such that for every recursively enumerable problem V there is a 
structure 23 with a first-order definition in C such that CSP(23) is polynomial-time 
equivalent to V . A second proof, based on the results in Section [i.4.2 of Chapter [lj 
can be found in the next section. 



Previously, Bauslaugh [15 showed that for every recursive function / there exists 
an infinite structure 23 such that CSP(23) is decidable, but has time complexity at 
least /. More recently, Schwandtner gave upper and lower bounds in the exponen- 



tial time hierarchy for some infinite domain CSPs 179 ; but these bounds leave an 
exponential gap. 

In this section we make essential use of the following theorem, which is due to 
Davis, Matiyasevich, Putnam, and Robinson. 



Theorem 11.1.1 (See e.g. 155 ). A subset ofl is recursively enumerable if and 
only if it has a primitive positive definition in (Z; *, +, 1), the integers with addition 
and multiplication. 

Consider the structure £ = (ZU {oo}; i?*, 1) where R+ is the ternary relation 

{(x, y,z) | x, y, z € Z, x + y — z} U {(oo, oo, oo)} 

and i?* is the ternary relation 

{(x, y, z) | x, y, z G Z, xy — z} U {(oo, oo, oo)} . 

Proposition 11.1.2. For every recursively enumerable language C there exists a 
relational structure 23 with a first-order (in fact, a primitive positive) definition in €. 
such that CSP{%) is polynomial-time Turing equivalent to C 

Proof. Code £ as a set L of natural numbers, viewing the binary encodings of 
natural numbers as bit strings, so that s 6 £ if and only if the number with binary 
encoding Is is in L, appending the symbol 1 at the front so that for instance 00 and 
01 correspond to different numbers in L. Now consider the structure 

23 := (ZU{oo};5,D,L') 

where 

• S is the binary relation defined by the formula R + (x, 1, y) in £, 

• D is the binary relation defined by R+(x,x,y) in £, and 

• L' is the relation L U {oo}. 
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We claim that also the unary relation V has a first-order definition in £. 

Note that because the set L is a decidable subset of Z, it has a primitive positive 



definition 8(x) over the structure (Z; +, *) by Theorem 11.1.1 Let V be the variable 



set of 8. Consider the graph G{5) with vertex set V where x and y are adjacent if 
they appear in the same atomic formula of 8. We can assume that G{8) is connected, 
because if it has a connected component C that does not contain x G V, then we can 
remove all conjuncts from 8 that contain variables from C (unless those conjuncts are 
unsatisfiable, in which case 8(x) defines the empty set over (Z; +, *) and the statement 
is trivial). Let 5' be the formula obtained from 8 by un-nesting atomic formulas in 8. 
That is, atomic formulas of the form t% + t% — ta, for terms ti, ^2,^3 over (Z; +, *) will 
be replaced by R+(x, y, z) A x = t\ A y — t-2 A z — t% where x, y, z are new existentially 
quantified variables. Atomic formulas of the form t± * t-2 — t 3 can be treated similarly, 
and by iterating this process we obtain a primitive positive l}-formula 8'; 

we claim that 8' defines L' over (£. 

Clearly, setting all variables of 8' to 00 satisfies all conjuncts of 8' , and hence 
8'(oo) holds over £ For every n <E L, there is an assignment s : V — > Z that shows 
that 6(n) holds over (Z;+,*). Then the same assignment shows that 8'(n) holds 
over £ For the converse, suppose that 8'(n) holds over £ Consider the assignment 
s : V ^ Zii {00} that shows that n satisfies 8' . Since G(S) is connected, G(S') is also 
connected, and it is easy to see that either s(y) = 00 for all y e V, or s(y) e Z for 
all y € V. In the latter case, 5(x) holds over (Z; +, *). Therefore, 8' defines L' over €. 
and 55 is primitive positive definable in £ 

We finally have to verify that CSP(03) is polynomial time equivalent to C. We 
first show that there is a polynomial-time reduction from C to CSP(03). View an 
instance of £ as a number n as above, and let rj(x) be a primitive positive defintion 
for x — n in 03. It is possible to find such a definition of size O(logn) by repeatedly 
doubling (y = x + x) and incrementing (y = x + 1) the value (this also follows from 
the more general Lemma 1.5.2). It is clear that n codes a yes-instance of C if and 
only if 3x(rj(x) A L'{x)) is true in 03. 

To reduce CSP(Q3) to £, we present a polynomial-time algorithm for CSP(Q3) that 
uses an oracle for C (so our reduction will be a polynomial-time Turing reduction). 
Let <j> be an instance of CSP(03), and let H be the undirected graph whose vertices 
are the variables W of 4>, and which has an edge between x and y if <j> contains the 
constraint S(x,y) or the constraint D(x,y). Compute the connected components of 
H. If <fi is true in 03 and contains a constraint D{x,y), then x = y = and every 
variable in the component of y in H is forced to a unique value, which can be found 
easily by traversing the component. If during this propagation one of the variables is 
assigned a non-integer value, the algorithm rejects. 

If H contains a cycle, then the corresponding constraints for a linear equation 
system. Such a system is either unsatisfiable, or it has infinitely many solutions, or a 
single solution, and which of those three cases applies can be decided in polynomial 
time by Gaussian elimination. If the system is unsatisfiable, then the algorithm 
rejects. If the system has infinitely many solutions, then it contains a constraint such 
that removing this constraint does not change the set of solutions. This constraint can 
also be found in polynomial time by Gaussian elimination, and we remove it. Finally, 
if the system has a unique solution, then this solution can be computed in polynomial 
time. If in this unique solution one of the variables is not integer, the algorithm 
rejects. Otherwise, we assign the corresponding integer values to the variables, and 
again propagate the values through the respective component of H. For each variable 
x that has already been assigned a number and which has a constraint L'(x), call 
an oracle for C. If any of those calls has a negative result, reject. Otherwise, if H 
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contains no cycles and no constraints of the form D(x,y), we can set all variables to 
oo and satisfy all the constraints. □ 

Let < be the order which extends the usual order on Z to Z U {00} by setting 
x < 00 for all x £ Z. Then note that the structure 03 from the previous proof has the 
operation (x,y) h-> max < (x,y) as a polymorphism. 

11.2. CSPs in SNP 

Another proof that shows that every problem in NP is polynomial-time Turing 
equivalent to an infinite domain CSP is based on a result by Feder and Vardi, and 
the results from Section fl. 4. 31 

Theorem 11.2.1 (Theorem 3 in [89] ) . Every problem in NP is equivalent to a 
problem in monotone SNP under polynomial-time reductions. 

We show the following. 

Proposition 11.2.2. Every problem in monotone SNP is equivalent to a problem 
in monotone connected SNP under polynomial-time Turing reductions. 

Proof. Let $ be a monotone SNP sentence of the form 3R%, . . . ,Rk Vxi, ... ,xi. (f> 
for cj) quantifier-free and in conjunctive normal form. The sentence \& that we are go- 
ing to construct from $ has an additional free relation symbol E, and an existentially 
quantified relation symbol T, and is defined by 

3Ri, . . .,R k ,TVxi, ...,xi.tp 

where ip is the quantifier-free first-order formula with the following clauses. 

(1) -,E(x 1 ,x 2 )VT(x 1 ,x 2 ); 

(2) ->T(xi,X2) V -nT(x 2 ,x 3 ) V T(x 1 ,x 3 ); 

(3) ->T(xi,X2)VT{x2,xi); 

(4) for each clause 4>' of <f> with variables x\, . . . ,x q , the clause 

41 v V -ir{xuxj) . 

i<j<q 

The sentence \I/ is clearly connected and monotone. We are therefore left with the 
task to verify that and are equivalent under polynomial-time Turing reductions. 

We start with the reduction from <!> to 1 J r . When 21 is a finite r-structure, we 
expand 21 to a (r U {i?})-structure 21' by choosing for E the full binary relation. Then 
also T must denote the full binary relation (so that the clauses from item (1), (2), 
and (3) above are satisfied), and the clauses introduced in (4) are equivalent to <f>' . 
Hence, $ holds on 21 if and only if ^ holds on 21'. 

For the reduction from VP to <&, let 21 be an instance of <5 '. We can compute the 
connected components C\ , . . . , of the {i?}-reduct of 21 in polynomial time in the 
size 21. For each of those connected components C, we evaluate $ on the r-reduct 2tc 
of 2l[C]. If for one component this evaluation is negative, then 2l[C] and consequently 
21 do not satisfy ^. Otherwise, for each C there exists an r U {Ri, ... , i?fc}-expansion 
of 2lc that satisfies 4>. Let 21' be the expansion of the disjoint union of all those 
(r U {R±, . . • , Rk}) -structures by the relation T denotes the equivalence relation with 
equivalence classes Ci, . . . , Cfe. Clearly, all clauses from items (1), (2), and (3) in the 
definition of ^ are satisfied by 21'. Each g-tuplc (ai, . . . , a q ) from elements of 21' either 
contains entries from different components, and hence satisfies the disjunctions from 
item (4), or contains only entries from the same component C, but in this case the 
tuple also satisfies the disjunctions from item (4) since 2lc satisfies □ 
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Corollary 11.2.3. For every problem in NP there is a structure 03 such that $ 
is polynomial-time Turing equivalent to CSP(03). 

Proof. By Theorem |11.2.1[ every problem in NP is equivalent to a monotone 
SNP sentence $ under polynomial-time reductions. We have shown in Proposi- 
tion 11.2.2 that $ is equivalent to a monotone connected SNP sentence and by 



Theorem 1.4.10 there exists an infinite structure 03 such that ^ describes CSP(03). □ 



In Figure [11. 1 1 the diagram about the fragments of SNP from Section [L4| has been 
decorated with information about the complexity classification status. 




FIGURE 11.1. Dichotomies and non-dichotomies for fragments of 
SNP. CSP(inf) (CSP(w-cat), CSP(fin)) refers to the class of all prob- 
lems CSP(03) where 03 is an infinite (w-categorical, finite, respec- 
tively) structure with finite relational signature. 



11.3. coNP-intermediate cj-categorical Templates 

In this section we show that there exists an w-categorical directed graph 03 such 
that CSP(03) is in coNP, but neither coNP-complete nor in P (unless coNP^P). All 
structures in this section will be Fraisse limits of classes of directed graphs. 

Let Af be a class of finite tournaments, and recall that Forb(A/"), the class of 
all finite digraphs that does not embed a tournament from Af, is an amalgamation 



class (Example 3.2.6). We write 03 n for the Frai'sse-limit of Forb(A/"). Observe that 
for finite Af the problem CSP(03a^) can be solved in deterministic polynomial time, 
because for a given instance 21 of this problem an algorithm simply has to check 
whether there is a homomorphism from one of the structures in Af to 21, which is the 
case if and only if there is a homomorphism from 21 to 03^". 

When proving that there are uncountably many homogeneous digraphs, Henson 
specified an infinite set T of tournaments Tx,T%, ■ ■ ■ with the property that Ti does not 
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embed into Ta if i =/= j. The tournament T„ in Henson's set T has vertices 0, . . . , n + 1, 
and the following edges: 

• (i, j) for j = i + 1 and < i < n; 

• (0,n + l); 

• (j, i) for j > i + 1 and (i, j) f (0, n + 1). 
PROPOSITION 11.3.1. TTie -problem CSP(Q3r) is coNP-complete. 

Proof. The problem is contained in coNP, because we can efficiently test whether 
a sequence v±, . . . , of distinct vertices of a given directed graph 21 induces T k in 
21, i.e., whether (vi,Vj) is an arc in 21 if and only if is an arc in T^., for all 

i, j G {1, . . . , k}. If for all such sequences of vertices this test is negative, we can be 
sure that 21 is from Forb(T), and hence homomorphically maps to 25r- Otherwise, 21 
embeds a structure from T, and hence does not homomorphically map to 55 7-. 

The proof of coNP-hardness goes by reduction from the complement of the NP- 



complete 3SAT problem (see Example 1.2.2), and is inspired by a classical reduction 



from 3-SAT to Clique. For a given 3-SAT instance, we create an instance 21 of 
CSP(23 r ) as follows: If 

{x\, Xq, Xq}, . . . , {x k+1 , Xf, +1 , 

are the clauses of the 3-SAT formula (we assume without loss of generality that the 
3-SAT instance has at least three clauses) , then the vertex set of 21 is 

{(0, 1), (0, 2), (0, 3), . . . , (k + 1, 1), (k + 1, 2), (k + 1, 3)} , 

and the arc set of 21 consists of all pairs {p, q)) of vertices such that x\ ^ ^x^ 

and such that (i,p) is an arc in Tj~. 

We claim that a 3-SAT instance is unsatisfiable if and only if the created in- 
stance 21 homomorphically maps to 03 7-. The 3-SAT instance is satisfiable iff there 
is a mapping from the variables to true and false such that in each clause at least 
one literal, say x 3 Q ° , . . . , xP^tl , is true. This is the case if and only if the vertices 
(0,ji), ...,(k+ 1, j k +i) induce T k in 21, i.e., ((i, (p,j P )) is an edge if and only if 
(i,p) is an edge in T k . This is the case if and only if T\~ embeds into 21, and if and 
only if 21 does not homomorphically map to Q37-. □ 



We now modify the proof of Ladner's Theorem given in |164| (which is basically 
Ladner's original proof) to create a subset 7o of T such that CSP(Q377,) is in coNP, 
but neither in P nor coNP-complete (unless coNP=P). One of the ideas in Ladner's 
proof is to 'blow holes into SAT', such that the resulting problem is too sparse to be 
NP-complete and to dense to be in P. Our modification is that we do not blow holes 
into a computational problem itself, but that we 'blow holes into the obstruction set 
T o/CSP(Q3 r )'- 

In the following, we fix one of the standard encodings of graphs as strings over the 
alphabet {0, 1}. Let Mi, Mi, ... be an enumeration of all polynomial-time bounded 
Turing machines, and let i?i , R2, . . . be an enumeration of all polynomial time bounded 
reductions. We assume that these enumerations are effective; it is well-known that 
such enumerations exist. 

The definition of 7o uses a Turing machine F that computes a function / : N —¥ N, 
which is defined below. The set 7o is then defined as follows. 



% = {T n \ f(n) is even } 



11.3. CONP-INTERMEDIATE w-CATEGORICAL TEMPLATES 
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The input number n is given to the machine F in unary representation. The compu- 
tation of F proceeds in two phases. In the first phase, F simulates itself] on input 
1, then on input 2, 3, and so on, until the number of computation steps of F in this 
phase exceeds n (we can always maintain a counter during the simulation to recognize 
when to stop). Let k be the value f(i) for the last input i for which the simulation 
was completely performed by F. 

In the second phase, the machine stops if phase two takes more than n compu- 
tation steps, and F returns k. We distinguish whether k is even or odd. If k is even, 
all directed graphs 21 on s = 1,2, 3, . . . vertices are enumerated. For each directed 
graph 21 in the enumeration the machine F simulates M^m on the encoding of 21. 
Moreover, F computes whether 21 homomorphically maps to Q3t - This is the case 
if for all structures XJ £ T that embed into 21 the value of f(l) is even. So F tests 
for I = 1,2, ... ,s whether T; embeds to 21 (F uses any straightforward exponential 
time algorithm for this purpose), and if it does, simulates itself on input I to find out 
whether f(l) is even. If 

(1) M k / 2 rejects and 21 homomorphically maps to 05^ , or 

(2) M k / 2 accepts and 21 does not homomorphically map to 05^ , 
then F returns k + 1 (and f(n) = k + 1). 

The other case of the second phase is that k is odd. Again F enumerates all 
directed graphs 21 on s = 1, 2, 3, . . . vertices, and simulates the computation of Ryk/2\ 
on the encoding of 21. Then F computes whether the output of R\k/2\ encodes a 
directed graph 21' that homomorphically maps to 03 j- Q . The graph 21' homomorphi- 
cally maps to ^B-j-g iff for all tournaments Ti that embed into 21' the value f(l) is 
even. Whether TJ embeds into 21' is tested with a straightforward exponential-time 
algorithm. To test whether f(l) is even, F simulates itself on input I. Finally, F tests 
with a straightforward exponential-time algorithm whether 21 homomorphically maps 
to <B r . If 

(3) 21 homomorphically maps to 03 7- and 21' does not homomorphically map to 
®T , or 

(4) 21 does not homomorphically map to 93 7- and 21' homomorphically maps to 

then F returns k + 1. 

Lemma 11.3.2. The function f is a non- decreasing function, that is, for all n we 
have f(n) < f(n + 1). 

Proof. We inductively assume that f(s — 1) < f(s) for all s < n, and have to 
show that f(n) < f(n + 1). Since F has more time to simulate itself when we run it 
on n + 1 instead of n, the value i computed in the first phase of F cannot become 
smaller. By inductive assumption, k = f(i) cannot become smaller as well. In the 
second phase, we either return k or k + 1. Hence, if k becomes larger in the first 
phase, the output of F cannot become smaller. If k does not become larger, then the 
only difference between the second phase of F for input n + 1 compared to input n 
is that there is more time for the computations. Hence, if the machine F on input n 
verifies condition (1),(2),(3),(4) for some graph 21 (and hence returns k + 1), then F 
also verifies this condition for 21 on input n + 1, and returns k + 1 as well. Otherwise, 
f(n) = k, and also here f(n + 1) > f(n) holds. □ 

Lemma 11.3.3. For all no there is n > no such that f{n) > /(no) (unless coNP ^ 

P)- 



Note that by the fixpoint theorem of recursion theory we can assume that F has access to its 
own description. 
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Proof. Assume for contradiction that there exists an n such that f(n) equals 
a constant k$ for all n > hq. Then there also exists an ri\ such that for all n > ri\ 
the value of k computed by the first phase of F on input n is ko- 

If fco is even, then on all inputs n > ?ii the second phase of F simulates M^ /2 
on encodings of an enumeration of graphs. Since the output of F must be ko, for all 
graphs neither (1) nor (2) can apply. Since this holds for all n > n\, the polynomial- 
time bounded machine M ko / 2 correctly decides CSP(*B7- ), and hence CSP(©^) is in 
P. But then there is the following polynomial-time algorithm that solves CSP(Q3-7-), 



a contradiction to coNP-completcness of CSP(*8-7-) (Proposition 11.3.1) and our as- 
sumption that coNP 7^ P. 



Input: A directed graph 21. 

Test whether 21 homomorphically maps to . 
If yes, accept. 

If no, test whether one of the finitely many graphs in T \ To embeds into 21. 
Accept if none of them embeds into 21. 
Reject otherwise. 



If ko is odd, then on all inputs n > n\ the second phase of F does not find a graph 
21 for which (3) or (4) applies, because the output of F must be ko- Hence, R\ k /2\ is 



a polynomial-time reduction from CSP(Q3-7-) to CSP(*B"7^), and by Proposition 11.3.1 
the problem CSP(Q3-7- ) is coNP-hard. But note that because f(n) equals the odd 
number k for all but finitely many n, the set To is finite. Therefore, CSP(<8-7- ) can 
be solved in polynomial time, contradicting our assumption that coNP =/= P. □ 

Theorem 11.3.4. CSP(Q3-7- ) is in coNP, but neither in P nor coNP- complete 
(unless coNP=P). 

Proof. It is easy to see that CSP(Q3-7- ) is in coNP. On input 21 the algorithm 
non-deterministically chooses a sequence of I vertices, and checks in polynomial time 
whether this sequence induces a copy of TJ. If yes, the algorithm computes f{l), which 
can be done in linear time by executing F on the unary representation of I. If f(l) 
is even, the algorithm accepts. Recall that 21 does not homomorphically map to OS-To 
iff a tournament 7] € To embeds into 21, which is the case iff there is an accepting 
computation path for the above non-deterministic algorithm. 

Suppose that CSP(25-7- ) is in P. Then for some i the machine Mi decides 



CSP(5$7- )- By Lemma 11.3.2 and Lemma 11.3.3 there exists an no such that f(no) 



2i. Then there must also be an n\ > n 2 such that the value k computed during the 
first phase of F on input n\ equals 2i. Since Mi correctly decides CSP(5$7- ), the 
machine F returns 2i on input n\. By Lemma |ll.3.2| the machine F also returns 2i 
for all inputs from n\ to ri2, and by induction it follows that it F returns 2i for all 



inputs larger than n > no, in contradiction to Lemma 11.3.3 

Finally, suppose that CSP^Xo) is coNP-complete. Then for some i the machine 
Ri is a valid reduction from CSP(Q3-7-) to CSP(Q3-7- )- Again, by Lemma 11.3.2 and 



Lemma ["ll.3.3| there exists an ri\ such that the value k computed during the first phase 
of F on input n\ equals 2i. Since the reduction Ri is correct, the machine F returns 
2i on input m, and in fact returns 2i on all inputs greater than n\. This contradicts 
Lemma 111.3.31 □ 



CHAPTER 12 



Future Work 



We conclude the thesis by mentioning four promising directions of future work. 



12.1. Phylogeny Constraints 

We have presented a classification of the complexity of CSP(Q3) for all structures 
*B with a first-order definition over (Q;<), or over the random graph (Y;E). One 
might ask which other structures, besides (Q; <) and the random graph (V; E), are 
interesting and promising candidates for such a classification. A very interesting 
candidate is the structure (L; |) (or equivalently, over a relatively 3-transitive C-set), 
introduced in Section |4~T) The class of CSPs that can be formulated with templates 
that can be defined over this structure is very large and contains many problems that 
have been independently studied in the literature, capturing for instance the rooted 
triple satisfaction problem and the quartet satisfiability problem from phylogenetic 
analysis. All the tools we needed for complexity classification are available: (L; |) is 



homogeneous, and an appropriate order expansion of it is Ramsey (see Example 8.1 



12.2. Datalog 



Feder and Vardi 89 observed that all the known algorithms for solving CSP(Q3), 
for a finite structure *B, are either based on algebraic algorithms that can be seen as 
generalizations of Gaussian elimination, or based on simple 'constraint propagation', 
or combinations of these two paradigms. This is still the case today. An elegant way 
to formalize algorithms that perform constraint propagation is Datalog. Datalog can 
be seen as conjunctive queries that have been extended by a recursion mechanisms; 
alternatively, one can view Datalog as Prolog (see e.g. 176 ) without function sym- 
bols. In the context of constraint satisfaction Datalog has been introduced in 
and further studied in 
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89 



Some of the early contributions were equivalent char- 
acterizations of the expressive power of Datalog in terms of bounded treewidth duality 
and existential pebble games. 

Recently, Barto and Kozik [12 presented an exact characterization of those CSPs 
where CSP(25) can be solved by a Datalog program. The characterization is universal- 
algebraic (see Chapter[5]), and confirming a conjecture of Larose and Zadori (140 . It 
was later shown to be equivalent to a conjecture made already by Feder and Vardi 
in 



89] , see |139 



Datalog programs are very useful to solve infinite-domain constraint satisfaction 



problems as well. It has been shown in 29 that when *B is an w-categorical structure, 



then the characterizations of the expressive power of Datalog in terms of bounded 
treewidth duality and existential pebble games remain valid. This has been applied to 
show that several fundamental infinite-domain CSPs in the literature cannot be solved 
by Datalog 39 42 . It would be very interesting to have an algebraic characterization 



of the expressive power of Datalog for CSPs with w-categorical templates. 
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12.3. Topological Clones 



We have seen in Section |7.4| that the topological automorphism group of an ui- 
categorical structure T describes T up to bi-interpretability; that is, two w-categorical 
structures T and A whose automorphism groups are isomorphic as topological groups 
are first-order bi-interpretable. 

We have also seen that the right tool for the complexity study of CSPs is primitive 
positive interpretability, and not first-order interpretability; see Section |5.5| So it 
is natural to ask in this context whether primitive positive interpretability can be 
characterized in terms of the polymorphism clone viewed as a topological clone, that 
is, viewed as an abstract clone equipped with the topology of point-wise convergence. 
We did not define abstract clones; but in this context it suffices to know that they 
relate to clones in the same way as permutation groups relate to abstract groups. 



A partial result in this direction has been obtained in joint work with Junker 36 
one of the results proven there is that two w-categorical structures without constant 
endomorphism are existential positive bi-interpretable^ if and only if their transfor- 
mation monoids, viewed as topological monoids, are isomorphic. 

The goal to lift this further to primitive positive interpretability amounts to show- 
ing that the topological clone of 58 characterizes the pseudo-variety generated by the 



polymorphism algebra of 03, via Theorem 5.5.15 



12.4. A Logic for P? 

In Section [i~4| we have seen a logical characterization of the complexity class NP: 
by Fagin's theorem, a problem is in NP if and only if it can be described in existential 
second-order logic. A similar logic for the complexity class P is not known. The 
question whether there exists a logic for P has been formalized by Gurevich |102| and 



became one of the most influential questions in finite model theory 101 

One approach to this question is to identify a fragment of existential second-order 
logic that still allows to express every problem in P, but where the question whether 
a given sentence describes a problem in P can be effectively decided. Such a logic 
would in fact give a positive answer to the problem as formulated by Gurevich. This 
approach motivates the study of expressive fragments of existential second-order logic, 
and it also motivates complexity classification for those fragments, with the eventual 
goal that the classification is sufficiently explicit so that the problem whether a given 
sentence from the logic describes a problem in P is decidable. Since the constructions 
in the proofs of Theorem |11.2.1| and Corollary |11.2.3| are effective, it can be shown 
that an algorithm that decides whether a given connected monotone SNP sentence 
describes a problem in P (note that every such problem is a CSP) would imply a 
positive answer to Gurevich's question. Such an algorithm probably doesn't exist. 
But it might exist for fragments of connected monotone SNP. Hence, further progress 
in complexity classification for infinite domain constraint satisfaction can also be 
motivated by the quest for a logic for P. 



A first-order interpretation is called existential positive if all the involved formulas of the 
interpretation are existential positive; existential positive bi-interpretations are defined by a similar 
modification of first-order bi-interpretations. 
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(with Barnaby Martin and Martin Hils), accepted for publication in Logical 
Methods in Computer Science (LMCS), arXiv:0909.5097, 2011. 

(2) "The Complexity of Rooted Phylogeny Problems" (with Jens Miiller). Log- 
ical Methods in Computer Science (LMCS), 7 (4:6), 2011. 

(3) "Boltzmann Samplers, Polya Theory, and Cycle Pointing" (with Eric Fusy, 
Mihyun Kang and Stefan Vigerske), Siam Journal on Computing (SICOMP), 
40(3): 721-769, 2011. 

(4) "H -categorical structures: Endomorphisms and Interpretations (with Markus 
Junker). Algebra Universalis, Volume 64, Numbers 3-4, 403-417, 2011. 

(5) "Locally Consistent Constraint Satisfaction Problems" (with Daniel Krai). 
Siam Journal on Discrete Mathematics (SIDMA), 40(3): 721-769, 2011. 

(6) "Horn versus full first-order: complexity dichotomies in algebraic constraint 
satisfaction" (with Peter Jonsson and Timo von Oertzen), to appear in Jour- 
nal on Logic and Computation (JLC), 2011. 

(7) "The Complexity of Temporal Constraint Satisfaction Problems" (with Jan 
Kara). Journal of the ACM (JACM), 57(2), 2010. 

(8) "Quantified Equality Constraints" (with Hubie Chen). Siam Journal on 
Computing (SICOMP), 39(8): 3682-3699, 2010. 

(9) "The Reducts of Equality (up to primitive positive inter-definability)" (with 
Hubie Chen and Michael Pinsker). Journal of Symbolic Logic (JSL), 75(4): 
1249-1292, 2010. 

(10) "Peek Arc Consistency" (with Hubie Chen). Theoretical Computer Science 
(TCS), 411(2): 445-453, 2010. 

(11) "A Fast Algorithm and Datalog Inexpressibility for Temporal Reasoning" 
(with Jan Kara), A CM Transactions on Computational Logic (TOCL), 
11(3), 2010. 

(12) "Maximal Infinite- valued Constraint Satisfaction Problems" (with Jan Kara, 
Hubie Chen, and Timo von Oertzen), Theoretical Computer Science (TCS) 
410, pages 1684-1693, 2009. 

(13) "Relatively quantified constraint satisfaction" (with Hubie Chen), Constraints, 
Volume 14, Issue 1, 2009. 

(14) "Integer Programming with 2-variable equalities and 1-variable inequalities" 
(with Gustav Nordh and Timo von Oertzen), Information Processing Let- 
ters, 109(11), pages 572-575, 2009. 
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(15) "Qualitative Temporal and Spatial Reasoning Revisited" (with Hubie Chen). 
Journal of Logic and Computation (JLC), 19(6): 1359-1383, 2009. 

(16) "The Complexity of Equality Constraint Languages" (with Jan Kara). The- 
ory of Computing Systems (TOCS), 43(2): 136-158, 2008. DOI 10.1007/s00224- 
007-9083-9. 

(17) "Generating Unlabeled Cubic Planar Graphs Uniformly at Random" (with 
Clemens Gropl and Mihyun Kang). Random Structures and Algorithms 
(RSA), 34: 157-180, 2008. 

(18) "Determining the Consistency of Partial Tree Descriptions" (with Martin 
Kutz), Artificial Intelligence (AI), 171: 185-196, 2007. 

(19) "Cores of Countably Categorical Structures". Logical Methods in Computer 
Science (LMCS), 2007. DOI: 10.2168/LMCS-3(1:2). 

(20) "Random Cubic Planar Graphs" (with Mihyun Kang, Mike Loffler and Colin 
McDiarmid). Random Structures and Algorithms (RSA), 30: 78-94, 2007. 
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Mihyun Kang and Marc Noy), European Journal on Combinatorics, 28: 
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125, 2007. 

(23) "A Direct Decomposition of 3-connected Planar Graphs" (joint work with 
Clemens Gropl, Daniel Johannsen, and Mihyun Kang). Seminaire Lotharingicn 
de Combinatoire (SLC), 54A: article B54Ak, 2007. 

(24) "Generating Labeled Planar Graphs Uniformly at Random" (with Clemens 
Gropl and Mihyun Kang). Theoretical Computer Science (TCS), 379(3): 
377-386, 2007. 

(25) "Enumeration of Unlabeled Outerplanar Graphs" (with Eric Fusy, Mihyun 
Kang and Stefan Vigerske), Electronic Journal of Combinatorics, 14, R66, 
2007. 

(26) "Constraint Satisfaction with Countable Homogeneous Templates" (with 
Jaroslav Nesetfil), Journal of Logic and Computation (JLC), 16(3): 359- 
373, 2006. 

(27) "Generating Outerplanar Graphs Uniformly at Random" (with Mihyun Kang) , 
Combinatorics, Probability and Computing (CPC), 15: 333-343, 2006. 
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(1) "Schaefer's Theorem for Graphs" (with Michael Pinsker), in the proceedings 
of the 43rd Annual ACM Symposium on Theory of Computing (STOC'll), 
San Diego, 2011. 

(2) "Decidability of Definability" (with Michael Pinsker and Todor Tsankov), in 
the proceedings of the Symposium on Logic in Computer Science (LICS'll), 
Toronto, 2011. 

(3) "Tractable Set Constraints" (with Martin Hils and Alex Krimkevitch) , in the 
proceedings of the International Joint Conferences on Artificial Intelligence 
(IJCAI'll), Barcelona, 2011. 

(4) "RCC8 is tractable on instances of bounded treewidth" (with Stefan Worn), 
in the proceedings of the International Joint Conferences on Artificial In- 
telligence (IJCAI'll), Barcelona, 2011. 
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(5) "Distance Constraint Satisfaction Problems" (with Victor Dalmau, Barnaby 
Martin, and Michael Pinsker), Proceedings of the 5th International Sym- 
posium on Mathematical Foundations of Computer Science (MFCS'10), 
162-173, arXiv:1004.3842, 2010. 
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Fall 2011: L'algorithmique et complexite de CSP, course at the MPRI (ad- 

vanced master program Paris), with Miki Hcrrman, partially in 
English. 

Fall 2010: L'algorithmique et complexite de CSP, course at the MPRI (ad- 

vanced master program Paris), with Miki Hcrrman, partially in 
English. 

Fall 2009: L 'algorithmique et complexite de CSP, course at the MPRI (ad- 

vanced master program Paris), with Miki Herrman, partially in 
English. 

Fall 2008: L'algoritmique et complexite de CSP, course at the MPRI (ad- 

vanced master program Paris), with Miki Hcrrman, partially in 
English. 

Fall 2007: Graphen und Algorithmen 1, Humboldt Universitat zu Berlin, 

tutorial and exercises, in German. 

Theoretische Informatik 2, Humboldt Universitat zu Berlin, tu- 
torial, in German. 

Summer 2007: Complexity of Constraint Satisfaction, Course at the 19th Euro- 

pean Summer School in Logic, Language and Information (ESS- 
LLI'07), with Hubic Chen, in English. 

Spring 2007: Graphen und Algorithmen II, Humboldt Universitat zu Berlin, 

lecture, in German. 

Fall 2006: Theoretische Informatik II, Humboldt Universitat zu Berlin, tu- 

torial and exercises, in German. 

Spring 2006: Constraint Satisfaction, Humboldt Universitat zu Berlin, semi- 

nar, in German. 

Theoretische Informatik III, Humboldt Universitat zu Berlin, tu- 
torial, in German. 

Fall 2005: Theoretische Informatik II, Humboldt Universitat zu Berlin, tu- 

torial, in German. 

Spring 2005: Combinatorics and its Applications, Humboldt Universitat zu 

Berlin, lecture, with Mihyun Kang, in English. 

Fall 2004: Graphen und Algorithmen I, Humboldt Universitat zu Berlin, 

tutorial and exercises, in German. 
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Spring 2004: Enumerative Combinatorics and Generating Functions, Hum- 

boldt Universitat zu Berlin, seminar, with Mihyun Kang, in Eng- 
lish. 

Spring 2003: The Strange Logic of Random Graphs, Humboldt Universitat zu 

Berlin, seminar, with Mihyun Kang, in English. 

Fall 2000: Teaching assistant for Algorithmen und Datenstrukturen, Prof. 

Dr. Kurt Mchlhorn, Universitat des Saarlandes. 

Spring 2000: Teaching assistant for Logik, Semantik und Verifikation, Prof. 

Dr. Gert Smolka, Universitat des Saarlandes. 

Fall 1999: Teaching assistant for Formal languages, Prof. Dr. Giinter Hotz, 

Universitat des Saarlandes. 

Spring 1999: Teaching assistant for Algorithmen und Datenstrukturen, Prof. 

Dr. Raimund Scidcl, Universitat des Saarlandes. 



